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PROJECT  DESCRIPTION 


INTRODUCTION 


The  research  on  this  project  centered  around  the  analysis, 
development,  and  implementation  of ^algorithms  for  representing  a 
surface  which  blends  together  two  or  more  intersecting 
surfaces.  The  blending  surface  should  provide  a  smoother 
transition  than  is  available  when  simply  considering  the 
intersection  of  the  surfaces.  The  algorithms  developed  must  be 
efficient  because  the  major  portion  of  computing  time  in  a  solid 
modeling  system  should  be  devoted  to  the  important  problems  of 
data  representation,  object  display,  surface  analysis  (ray 
tracing,  shadowing,  etc.)  and  scene  analysis  (hidden 
line/surface  removal,  etc.)  The  important  features  of  a 
blending  surface  are  speed  of  computation,  ability  to  be 
incorporated  into  the  other  features  of  a  solid  modeling  system 
(such  as  hidden  line/surface  removal),  and  the  visual  quality  of 
the  blended  image. ' 


‘‘■'This  report  is  organized  as  follows.  Section  1  is  an 
introduction.  Section  2  briefly  describes  some  basic  concepts 
from  solid  modeling.  Section  3  is  a  description  of  the 
organization  of  the  research,  major  findings,  and  publications. 
Section  4  describes  additional  directions  for  research. 
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2. 


SOME  CONCEPTS  FROM  SOLID  MODELING 


Computer  aided  design  (CAD)  systems  depend  upon 
representations  of  solid  objects  which  use  an  abstract, 
geometrical  model  rather  than  simply  representing  them  as 
collections  of  pixels  displayed  on  a  CRT  screen.  There  are 
several  common  methods  for  modeling  geometric  information,  each 
with  its  own  advantages  and  disadvantages.  The  most  common 
methods  are  the  boundary  representation  in  which  an  object  is 
described  by  its  boundary;  constructive  solid  geometry,  in  which 
an  object  is  described  by  its  boundary;  constructive  solid 
geometry,  in  which  an  object  is  described  by  an  algorithm  for 
constructing  it  from  geometric  premitives  such  as  speheres  or 
boxes;  and  a  data-str uctured  oriented  method  such  as  octrees,  in 
which  an  object  is  described  by  the  portions  of  space  it 
occupies . 

Each  of  these  methods  admits  additional  refinements.  For 
example,  the  boundary  representation  of  a  single  surface  may  be 
given  by  the  implicit  representation  F  (x,y,z)»0,  the  explicit 
representation  of  the  form  z«f(x,y)  which  is  obtained  by  solving 
the  implicit  equation  (where  possible  and  where  a  unique 
solution  exists), and  the  explicit  parametric  representation 
where  x,y,  and  z  are  described  by  a  set  of  parametric  equations. 

The  implicit  representation  is  convenient  for  finding  the 
intersection  of  two  surfaces.  However,  the  other  methods  are 
often  more  convenient  for  actually  graphing  objects. 

Research  during  the  project  period  has  centered  on  the 
boundary  representation  method  using  all  three  of  the  techniques 
of  implicit  and  explicit  surface  description. 


3.  SUMMARY  OF  RESEARCH  FINDINGS 


The  research  on  this  problem  has  taken  two  directions: 
efficiency  of  computational  algorithms  and 
development/implementation  of  mathematical  models. 

The  research  on  the  efficiency  of  computational  algorithms 
has  reinforced  a  well-known  phenomenon  in  computer  graphics  - 
that  very  high  speed  cpu's,  parallel/distributed  processing,  or 
specialized  graphics  hardware  are  frequently  necessary  for 
responsive  systems.  In  the  paper  "Evaluating  the  Performance  of 
a  User  Interface"  (Computes  and  Graphics  volume  11  no.  2  (1987), 
141-146),  algorithms  for  evaluating  performance  of  display 
systems  (especially  window  display  and  menu  selection  systems) 
were  given.  These  algorithms  are  appropriate  for  general 
systems,  with  no  particular  emphasis  on  solid  modeling  systems. 

The  primary  focus  of  "Complexity  of  Computer  Algorithms" 
(Rocky  Mountain  J.  Math  volume  17  (1987),  167-187)  was  general 
computer  algorithms.  However,  particular  algorithms  for 
polynomial  evaluation  via  look-up  tables  and  Tchebycheff 
polynomials  were  also  presented*  polynomial  evaluation 
algorithms  are  necessary  for  any  method  of  graphical 
representation  using  bicubic  or  similar  patches.  This  work  has 
naturally  lead  to  the  study  of  special  purpose  parallel 
algorithms  for  fast  polynomial  evaluation: 

Publications  in  the  area  of  parallel  algorithms  relevant  to 
computer  graphics  include  "Ada  Software  Metric  and  Their 
Limitations"  (Proc.  Joint  Ada  Conference,  Washington,  DC.  (March 
1987),  285-293)  in  which  formal  measuremnts  of  software 
complexity  were  made,  "Use  of  Concurrent  Tasking  Paradigms  for 
the  Design  of  Ada  Programs" (Proc  6th  Annual  Conference  on  Ada 
Technology,  Washington,  D.C.  (March  1988),  153-156)  in  which 
formal  models  of  concurrent/parallel  programs  were  used,  and 
"Acutal  Complexity  of  Parallel  Evaluation  of  Low  Degree 
Polynomials"  (being  reviewed  for  publication)  in  which  several 
algorithms  are  evaluated.  The  first  two  papers  mentioned  in 
this  paragraph  consider  high-level  algorithms  while  the  third 
paper  is  concerned  with  low  level  computations  including  counts 
of  memory  accesses  and  inter-process  communications.  An 
important  result  in  that  paper  is  that  "efficiency"  larger  than 
1  is  possible  for  evaluation  of  cubic  polynomials  at  multiple 
points.  The  efficiency  is  defined  here  as 
N*T (N)/T(l) 

where  T(l)  represents  the  time  if  one  processor  is  used  and  T(N) 
represents  the  time  if  N  processors  are  used.  This  high 
efficiency  is  based  on  a  pipelined  architecture  based  on 
deCastle jan ’ s  algorithm. 

Comparisons  between  this  specialized  architecture  and 
several  algorithms  such  as  Knuth's,  Horner’s  and  a  finite 
difference  method  are  also  given;  none  of  the  sequential, 
non-pipelined  algorithms  achieve  efficiency  of  1. 


An  offshot  of  some  of  this  work  was  two  papers  primarily  on 
education,  "Experiences  Teaching  Concurrency  in  Ada"  (AdaLetters 
vol.  7  no.  2  (1987),  40-41)  and  "A  Suggested  Topic  for  the  First 
Course  in  Computer  Science"  (SIGCSE  Bulletin,  vol  30,  no.  2 
(1988),  40-43). 


The  other  research  direction  that  was  considered  involved 
the  development/implementation  of  mathematical  algorithms  for 
blending  surfaces.  The  goal  of  this  research  was  to  allow  the 
inclusion  of  geometric  information  into  blending  surfaces.  the 
initial  geometric  information  used  was  the  minimization  of  the 
surface  area  of  the  blending  surface. 


The  first  technique  used  was  based  on  the  observation  that 
any  such  blending  surface  which  minimizes  surface  area  must  be  a 
minimal  surface  area  must  be  a  minimal  surface.  Minimal 
surfaces  are  surfaces  which  satisfy  the  non-linear  partial 
differential  equation 
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(assurming  z  is  a  function  of  x  and  y).  For  computational 
purposes,  the  most  appealing  minimal  surfaces  for  blending 
purposes  are  those  of  low  degree.  Therefore  the  first  objective 
in  this  area  of  research  was  to  obtain  minimal  surfaces  of  low 
degree . 


Surfaces  of  degree  3  or  4  hae  been  classified  by  Salmon  as 
falling  into  one  of  several  classifications.  The  determination 
of  which  of  these  low  degree  surfaces,  if  any,  is  a  minimal 
surface  is  a  formidable  computational  problem.  After  applying 
various  simplifying  transformations  to  the  equations  of  the 
surfaces,  the  equations  were  used  as  input  to  the  symbolic 
manipulation  package  MACSYMA  for  determining  if  the  surface 
satisfied  the  minimal  surface  equation.  Results  were  obtained 
for  a  few  of  the  possible  categories  of  surfaces;  however, 
problems  with  the  disk  drives  of  the  computer  prevented  a 
complete  solution  of  the  problem.  (The  computer  hardware 
problems  have  been  documented  in  the  most  recent  interim  project 
report.  The  disk  drive  problems  generally  made  the  use  of  large 
virtual  memory  space  impossible  and  thus  only  incomplete  results 
could  be  obtained  because  of  the  large  intermediate  size  of 
algebraic  expressions  given  by  MACSYMA  before  simplification). 

Since  only  incomplete  results  were  obtained  in  the  search 
for  low  degree  minimal  surfaces,  with  the  implicity 
representation,  the  project  now  considered  the  use  of  explicit 
parametric  representation.  A  classical  result  of  Weierstrass 
was  used  as  the  starting  point  -  parametr izations  of  minimal 
surfaces  arise  from  integrating  from  0  to  z  the  functions 
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where  g  and  w  are  analytic  functions  on  some  domain.  The 
parameters  are  the  real  and  imaginary  parts  of  the  independent 
variable  z.  An  important  contribution  here  was  the  observation 
that  these  analytic  functions  have  additional  degrees  of  freedom 
which  provide  6  additional  degrees  of  freedom  in  the  simplest 
known  surfaces  (the  Enneper's  surface)  and  more  for  surfaces  of 
higher  algebraic  degree.  This  research  is  incorported  in  the 
papers  "Minimal  Blending  Surfaces"  and  "Geometric  Considerations 
in  Blending  Surfaces".  Both  of  these  papers  have  been  submitted 
for  publication  and  are  still  in  the  reviewing  process.  Each  of 
these  papers  indicate  the  use  of  these  minimal  surfaces  as 
blending  surfaces. 

The  papers  have  concentrated  on  simple  problems  in  blending 
surfaces  which  are  sections  of  minimal  surfaces.  The  additional 
degrees  of  freedom  mentioned  above  are  used  for  curve  fitting 
and  for  visual  appeal. 


4.  FUTURE  DIRECTIONS 


Current  work  is  directed  towards  using  other  geometrical 
constraints  such  as  minimizing  additional  volume  due  to  blendin 
surfaces  (with  some  tangency  information  added)  and  towards 
using  methods  of  the  calculus  of  variations  for  problems  posed 
in  terms  of  surfaces  which  are  described  parametrically.  The 
fundamental  idea  is  to  use  a  Ritz  method  to  find  good 
approximate  solutions  which  are  in  the  class  of  low  degree 
surfaces.  It  is  expected  that  results  will  be  submitted  for 
publication  by  the  end  of  1988. 

Additional  work  on  the  determination  which  of  the  surfaces 
of  degree  3  or  4  as  minimal  surfaces  will  be  postponed  until 
sufficient  reliable  disk  drive  capacity  becomes  available. 
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ABSTRACT 

A  eajor  goal  of  software 
engineering  research  la  the 
developaent  of  aetrics  which  aeasure 
the  complexity  end  aslntalnabillty  of 
prograas,  with  a  saall  portion  of  this 
effort  directed  specifically  towarda 
prograas  written  in  Ada.  This  paper 
will  foens  on  two  aaln  theaea.  The 
first  these  will  be  the  development  of 
aetrics  that  specifically  reflect  the 
coaplexity  of  prograas  In  Ada.  The 
second  theae  will  be  an  investigation 
of  the  theoretical  Halts  of  aetrics 
as  measures  of  program  coaplexity  In 
general. 


1.  INTRODUCTION 

There  has  been  a  considerable 
aaount  of  research  activity  directed 
towards  the  developaent  of 
aeasureaents  of  coaplexity  of  prograas 
which  have  high  correlation  with 
prograaaing  effort.  See  for  example 
references  [I],  [4],  [5],  [6],  [7). 
[8],  [9],  [11],  [13].  Much  of  the 
work  aay  be  broadly  classified  into 
three  categories,  each  related  to  a 
aodel  of  prograaaing  coaplexity. 

The  first  aodel  aasuaes  that  the 
prograaaing  coaplexity  is  the  sua  of 
the  prograaaing  complexities  of  the 
various  aodules  asking  up  the 
program.  A  typical  exaaple  of  this 
research  [4]  uses  aeasurea  such  as  the 
nuaber  of  operators  and  operands, 
nuaber  of  distinct  operators  and 
operands,  etc.  No  special  use  is  aade 
of  control  structures;  for  exaaple,  a 
GOTO  statement  is  treated  as  having  an 
operator  and  operand.  Another  exaaple 
of  this  type  is  the  cycloaatic  aeasure 
of  McCabe  [8].  Here  the  branchng  and 
flow  of  control  is  considered,  with 
the  major  concern  being  computation  of 


a  nuaber  called  the  cycloaatic 
coaplexity  which  is  basically  Euler's 
foraula  applied  to  a  graph  which 
represents  the  prograa.  Kafura  and 
Henry  [5]  call  these  aetrics 
alcrolevel . 

The  second  aodel  aseuaes  that  the 
particular  aodules  coaprislng  a 
prograa  are  relatively  straightforward 
and  that  the  major  factors  in  prograa 
coaplexity  are  the  interconnections 
between  these  aodules.  A  primary 
exaaple  of  this  type  of  research  is 

[5]  and  the  references  indicated 
there. 

The  third  aodel  [3]  aseuaes  that  a 
primary  factor  in  prograa  coaplexity 
is  the  experience  of  the  programmer 
with  other  factors  such  as  the  goals 
of  prograa  efficiency  or  storage 
constraints  having  some  effect  on  the 
coaplexity . 

Very  few  researchers  consider  more 
than  one  of  these  models;  Kearney,  et  al 

[6]  is  an  exception. 

We  note  that  auch  of  the  literature 
on  software  aetrics  is  concerned  with 
the  coding  phase  of  software 
developaent.  Few  articles  explicitly 
address  the  points  aade  by  Carrio  [3], 
Raaaaoorthy  [12]  and  aany  others  that 
aaintenance  is  a  major  portion  of  the 
software  life  cycle.  Carrio  states  that 
nany  of  the  changes  are  caused  by  what 
he  calls  "pseudo-maintenance"  activities 
which  change  the  scope  of  the  project  by 
adding  features  or  changing  requirement 
specifications.  Raaaaoorthy  describes  a 
saaple  of  282  programs  with  failures  in 
which  38. 2Z  are  caused  by  problems  in 
the  requireaents/specif ication  level. 

One  of  the  claims  made  for  Ada  is 
that  it  will  reduce  the  total  aaount  c 
coaplexity  of  software  in  a  particuic; 
installation  by  encouraging  reusable 
code  and  to  soae  degree  by  acting  at 
least  in  part  as  a  program  design 
language.  In  the  conclusion  of  this 
paper  we  will  make  some  observations 
about  these  claims. 
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In  chin  paper  we  consider  a  variety 
of  software  aetrics  which  are  applied 
to  over  30  prograaa  written  in  the 
language  Ada.  Tha  paper  is  orgsnizsd 
ss  follows.  In  section  2  ws  describe 
the  results  obtained  by  applying 
savaral  aetrics  such  as  in  [1],[4], 
13],  [7],  [8].  In  section  3  wa 
consider  the  seas  data  using  soaa  new 
aetrics. 

Sections  4  and  3  are  concarnsd  with 
liaitatlons  of  software  aetrics  for  Ada 
progress.  Section  4  introduces  the 
concept  of  a  tine-varying  aetric.  Such 
aetrics  have  the  goal  of  aasauring  that 
portion  of  a  prograa  aost  likely  to 
. change  because  of  a  change  in  prograa 
specif icationa.  The  aetrics  are  also 
evaluated  on  tha  seas  data  sat. 
Relevance  of  such  aetrics  and  tha 
associatad  statistics  to  tha  software 
life  cycle  la  discussed. 

Section  3  of  tha  paper  includes  a 
discussion  of  Inherent  coaplaslty  of 
progress  and  tha  resulting  liaitatlons 
of  software  aetrics. 

Section  6  provides  a  suaaary  of  tha 
results  obtained  and  soae  suggestions 
for  future  work. 


2.  APPLICATION  OF  EXISTING  METRICS 

This  paper  has  the  twin  goals  of 
developing  software  aetrics  for  Ada 
prograaa-  and  describing  the  liaitatlons 
of  such  aetrics.  In  this  section  we 
discuss  the  developaent  of  aetrics. 

When  developing  new  aetrics,  it  is 
instructive  to  examine  the  behavior  of 
soae  of  the  classical  aetrics  on  a  set 
of  saaple  prograaa.  The  saaple 
prograaa  were  selected  at  randoa  froa  a 
variety  of  textbooks  on  Ada;  they  range 
in  length  froa  33  to  236  lines  of 
code.  He  eaphasize  that  this  is  not  a 
foraal  experiaent.  Instead,  we 
consider  the  exsaples  given  here  as 
providing  experience  in  the  collection 
of  data  for  the  developaent  of  aore 
coapleta  aetrlca  such  as  those 
considered  in  the  next  section. 

With  these  csveats  in  alnd  we 
present  the  dntn  in  Tnble  1  for  these 
progrnas.  We  nee  the  notion  of 
operator  and  op«iand  aa  described  in 
[4],  The  correlation  between  the 
Halstead  and  McCabe  aetrics  for  these 
progress  is  a  low  0.139  explaining  only 
37Z  of  the  variance.  A  graphical  view 
of  these  aetrics  is  given  in  figure  1 
at  the  end  ofa  the  paper  where  they  ere 
coapared  with* a  new  Ada  aetric. 


Table  1 


Prograa 

Halstead 

McCabe 

Lines 

1 

7611 

5 

33 

2 

5148 

6 

51 

3 

13485 

4 

55 

4 

215983 

8 

111 

5 

46189 

8 

115 

6 

42169 

5 

79 

7 

32733 

4 

42 

8 

73171 

7 

72 

9 

13537 

14 

91 

10 

35235 

7 

74 

11 

19162 

8 

58 

12 

12832 

2 

45 

13 

5923 

3 

39 

14 

16476 

5 

67 

15 

47071 

2 

81 

16 

37405 

5 

50 

17 

2482 

3 

36 

18 

19826 

4 

68 

19 

2145 

3 

41 

20 

1572 

3 

65 

21 

3381 

1 

32  • 

22 

10806 

3 

39 

23 

160794 

3 

91 

24 

100991 

12 

92 

23 

47611 

6 

72 

26 

44781 

7 

66 

27 

14136 

4 

68 

28 

6441 

4 

48 

29 

64893 

24 

236 

30 

5159 

3 

50 

3.  ADA-SPECIFIC  METRICS 

In  the  previous  section  we  saw  that 
for  a  large  collection  of  Ada  progress, 
there  is  low  correlation  between  the 
frequently  used  Halstead  and  McCabe 
aetrics.  Clearly  neither  aetric 
coapletely  neaaures  software 
coaplexity.  Thus  we  need  to  exaaine 
aetrics  providing  "orthogonal"  views  of 
a  prograa.  Such  aetrics  will  use  soae 
of  the  ideas  of  the  interconnection 
ideas  of  [5]  as  well  ss  [7]. 

We  consider  a  collection  of  Ada 
language  features  whose  presence  aay 
explain  the  wide  variation  between  the 
various  aetrics.  Thesa  features  are 
grouped  by  their  perceived  effects  on 
language  level,  prograaaers'  specific 
abilities,  portability,  and 
verifiability.  We  consider  only  those 
features  that  are  specific  to  Ada  and 
not  available  in  other  languages  such  as 
Pascal.  The  reason  for  this  is  that  an 
Ada  prograa  written  Only  using 
Pascal-like  features  can  have  its 
software  quality  aeasureed  by  obvious 
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translation*  of  Pascal  aetrlcs  (assualng 
that  chars  ara  adequate  aatrlcs  for 
Pascal  prograas). 

A.  FEATURES  DUE  TO  LANGUAGE  LEVEL. 

1.  Naas  Equivalence 

For  exaaple,  tho  daclaratlons 

A.B:  INTEGER; 

C:  INTEGER; 

Type  DATATYPE  la  saw  INTEGER; 

D:  DATATYPE; 

Type  INT  TYPE  is  saw  INTEGER; 

E:  INT_TtPE; 

allow  A.B,  or  C  to  ha  considered  tha 
saaa  typa.  bat  D  is  consldarad  a 
diffaraat  typa.  This  has  so  affact  os 
McCabe' a  aatrlcs.  There  is  no  affect 
os  Halstead's  if  we  consider  only 
executable  stataaants  and  increasa  tha 
nnabar  of  lines  of  coda  by  A. 

Tha  variables  A.B, and  C  ara  all 
declared  as  being  of  typa  INTEGER. 
Writing  tha  declaration  in  this  fora 
require  aore  source  code  text  than  does 

A.B.C:  INTEGER 

However  if  it  is  aceoapanled  by  a 
eoaaent  explaining  the  significance  of 
the  variable  C,  then  the  effect  of  this 
longer  fora  of  the  declaration  is  to 
slightly  increase  readability  of  the 
code  and  hopefully  to  slightly  reduce 
prograa  coaplexlty. 

Tho  declaration  of  D  as  being  a 
distinct  type  (called  a  derived  type  in 
Ada)  allows  storage  of  D  and  allowa 
aany  operations  to  be  perforaed  on  D. 
However,  it  does  not  allow  operation? 
such  as  the  addition  of  a  variable  to 
type  INT_TTPE  to  a  variable  of  type 
DATATYPE.  This  actually  reduces  the 
coaplexlty  of  the  prograa  since  it 
precludes  "accidental"  errors  such  as 
adding  a  Zip  Code  to  a  Social  Security 
nuaber  and  expecting  e  sensible  result. 

Note  that  this  phenoaenon  does  not 
occur  in  a  language  with  only 
"structure  equivalence"  of  naaee.  In 
such  languages,  addition  of  A  and  D  is 
a  legltlaate  operation. 

2.  Generics 

Generic  packages  provide  an 
opportunity  for  data  abstraction.  As 
such,  they  represent  an  opportunity  for 
the  prograa  to  represent  an  algoritha 
aore  dearly.  Thus  tha  affect  of 
"generics"  will  be  to  reduce  the 
coaplexlty  of  the  software  during  the 
design  and  coding  phase  since  aaong 
other  things  they  reduce  the  nuaber  of 
subroutines  and  lines  of  code  of  the 
prograa.  However,  aa  was  pointed  out  in 
[1],  generics  fora  a  teaplate  whose 


correctness  in  a  particular  prograa  is 
difficult  to  test  without  exhaustive 
consideration  of  all  cases  of 
instantiations. 

* 

B.  FEATURES  WHICH  INFLUENCE  PROGRAMMER 
UNDERSTANDING. 

1.  Tasking 

The  ability  to  specify  operations 
which  need  not  be  executed  sequentially 
is  one  fo  the  aajor  features  of  Ada. 

The  writing  and  debugging  of  prograas 
involving  tasks  is  coaplicated  by  the 
fact  that  soae  errors  will  becoae  known 
only  when  certain  orders  of  stateaent 
execution  ere  followed  and  these 
particular  orders  often  occur  long 
after  the  testing  phase. 

Tasking  is  also  one  of  the  few 
factors  present  in  all  phases  of  the 
prograa  life  cycle  froa  apeclf icatlon 
to  aaintenance.  Therefore  it  aust  be 
included  in  aetrlcs  which  are  to  be 
applied  at  various  tiaes  during  the 
life  cycle.  In  addition,  tasking  used 
to  iaprove  perforaance  by  splitting 
execution  of  processes  onto  aany 
processors  should  probably  change  when 
the  nuaber  of  proceasors  available  in 
any  lapleaentation  of  the  software 
increases.  This  facet  of  tasking 
therefore  will  also  affect  the 
portability  of  code  soaewhere  during 
the  life  cycle. 

Because  of  the  coaplexlty 
introduced  by  tasking,  we  aust  treat 
arrays  of  tasks  separately  froa  the  way 
we  treat  arrays  of  data  objects. 
Declaration  of  an  array  of  data  objects 
does  not  change  any  aeasureaent  of 
software  based  only  on  executable 
stateaents.  An  array  of  tasks  provides 
far  sore  opportunity  for  errors  in 
interconnection  between  two  tasks  than 
do  one  or  two  tasks.  Thus  the  nuaber 
of  eleaents  in  an  array  of  tasks  has  a 
great  affect  on  any  reasonable  Ada 
software  aetric. 

An  even  worse  situation  is  caused 
by  the  Ada  language  allowing  the 
creation  of  pointers  to  tasks.  Each 
additional  task  increases  software 
coaplexlty.  However,  the  nuaber  of 
tasks  cannot  be  deterained  until  after 
execution  of  the  prograa.  We  return  to 
this  point  later. 

2.  Subfeatures  of  tasking 

Many  of  the  probleas  occuring  in 

software  which  allows  concurrency  are 
caused  by  synchronization  of 
processes.  With  this  in  mind  we 
observe  chat  the  reserved  words 
"select",  "accept",  "entry",  "delay". 
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"abort"  in  tha  context  of  tasks  aust 
increase  the  complexity  of  the  software 
(ond  therefore  any  coaplexity  metric). 

3.  Private  declarations 

Declarations  using  the  work 
"private"  tend  to  reduce  coaplexity  in 
interconnection  aetrlcs  since  they 
alnialze  tha  Interface  between 
eoaponents  of  the  software.  "Limited 
private"  declarations  further  restrict 
the  interface.  Tha  presence  of  such 
declarations  reduces  coaplexity. 

4.  Node  restriction 

Restricting  the  mode  of  parameters 
to  be  "in",  "out",  or  "in  out"  in 
procedures  and  "in”  only  in  functions 
reduces  side  effects  and  thus  reduces 
coaplexity. 

5.  Exception  handling. 

The  factors  that  causa  exceptions 
are  present  in  every  substantial 
software  project.  Exception  handling 
facilities  in  Ada  provide  a  clean  way 
of  creating  exceptions.  The  effect  on 
software  aetrlcs  la  to  increase  the 
lines  of  code,  operators  and  operands 
and  thus  Increase  coaplexity.  However, 
exception  handling  is  probably  the 
simplest  way  to  write  certain  segaenta 
of  code  and  thus  the  effect  on  a  aetrlc 
should  be  relatively  minor. 

C.  FEATURES  WHICH  INFLUENCE 
PORTABILITY. 

1.  Packages 

Packages  encourage  nodularity  which 
is  of  vital  iaportance  in  structured 
design.  Clearly  a  piece  of  software 
can  be  ported  to  another  installation 
only,  if  all  of  the  packages  called  by 
the  aoftware  are  also  ported  and  there 
are  no  naae  conflicts  with  existing 
packages  in  th«  new  installation. 

2.  Generics 

Generic  packages  encourage 
portability  by  requiring  only 
instantiation  to  work.  However, 
considerable  care  aust  be  given  to  the 
testing  of  generic  packages,  since  each 
instantiation  of  a  package  for  a  new 
data  type  ahould  be  tested  like  a  new 
package. 


3. Interfaces  to  other  languages 

Dots  abstraction  and  information 
hiding  are  najor  features  of  Ada. 
Clearly  any  software  interface  to 
another  language  decreases  the 
usefulness  of  these  features.  Consider 
the  following  example  froa  the  Ada 
Reference  Manual  [10,  p.  217]: 

package  FORT-LIB  la 

function  SQRT  (Xj  FLOAT)  return  FLOAT; 
function  EXP  (X: FLOAT)  return  FLOAT; 
private 

pragaa  INTERFACE  (FORTRAN,  SQRT); 
pragma  INTERFACE  (FORTRAN,  EXP); 
end  FORT-LIB; 

This  use  of  the  pragaa  INTERFACE 
decreases  portability  because  it  uses 
languages  that  aay  not  be  available  in 
all  installations  and  becasua  the 
capability  for  interfaces  to  other 
languages  need  not  be  aade  available  in 
all  implementations  of  Ada  [10,  p. 

217].  Clearly,  this  increases 
coaplexity . 

4.  Machine  code  insertions 

Clearly  machine  code  in  an  Ada 
program  eliminates  portability  of  that 
section  of  the  program  that  inlcudes 
the  machine  code.  Some  machine  code 
can  be  reorganized  by  the  presence  of 
the  pragma  INLINE  and  the  use  of  the 
predefined  library  package  MACHINE 
CODE. 

Another  source  of  machine  dependent 
code  is  he  use  of  specific  locations. 
Examples  of  these  are  the  for-use, 
at-mod  and  use-at  constructions. 
Examples  are: 
for  RIGHT_MASK  use  S#001# 

(using  a  bit  pattern  to  mask  input 

for  example) 

FOR_FAILURE_SIGNAL  use  at  814041# 

(using  an  octal  representation 
of  a  port) 

and 

for  PIXEL_ST0RAGE  use 
record  at  mod  4; 

X  at  SOME  X  VALUE; 

Y  at  S0ME~Y”VALUE; 

COLOR  at  SOME  COLOR; 

INTENSITY  at  S0ME_LEVEL; 
end  record; 

This  last  example  might  be  used  in  a 
graphics  program  in  which  we  intend  to 
move  a  block  of  pixels  and  wish  to 
speed  up  their  movement  by  aligning 
with  byte  or  word  boundaries. 


288  Joint  Ada  Conference  1887 


D.  FEATURES  WHICH  INFLUENCE 
VERIFIABILITY 

1.  Naaed  paraaeter  association 

Consider  a  generic  package 
generic 

X.Y:  FLOAT:  -0.0; 
package  POINT  ia  ... 
package  FIRST  POINT  is  new  POINT 
(3. 7. 2.1); 

package  SECOND  POINT  is  new  POINT 
(X  ->  3.7), 

T  ->  2.6); 

In  the  package  SECOND^ POINT,  the  naaed 
paraaeter  association  telle  us  about 
the  naaes  of  the  paraaeters  as  wall  as 

their  values.  The  other  package 
describes  the  values  by  using 
positional  notation.  Naaed  paraaeter 
association  tends  to  decrease 
coaplexlty  when  used  both  in  this 
context  and  in  the  context  of  fields  of 
a  record. 

2.  Global  variables. 

Use  of  such  variables  often 
increases  coaplexlty  because  the 
availability  of  a  shared  variable  to 
two  tasks  aeans  that  neither  can  asauae 
anything  about  the  order  in  which  the 
operations  of  the  various  tasks  are 
perforaed  except  at  synchronisation 
points.  The  syntax  for  shared 
varlablea  is 

pragaa  SHARED  (var_naae); 

Shered  varlablea  increase  coaplexlty 
since  they  Increase  opportunity  for 
errors. 

4  .  TIME  VARYING  METRICS 

In  the  previous  sections  we 
discussed  aoae  standard  aetrics  and  the 
results  of  their  application  to  a  large 
set  of  Ada  progress.  The  aetrics  used 
all  assign  a  coaplexlty  aeaeure  to 
prograaa  and  to  their  coaponent 
aodules,  with  the  priaary  purpose  being 
the  early  identification  of  those 
progress  or  aodulea  which  are  aoat 
likely  to  require  changes  in  the 
developaent  stage.  We  now  consider  the 
behavior  of  aetrics  when  they  are 
applied  to  progress  during  the  entire 
life  cycle  rather  than  restricting 
attention  to  the  developaent  pheae. 

Note  that  any  aetric  for  Ada  prograas 
auat  take  into  account  the  factors 
aentloned  in  section  3. 

Consider  the  standard  aodel  ox  the 
software  life  cycle.  The  "aaintennnce 
phase"  is  often  caused  by  either 
porting  code  to  another  aachinn  or  by 
changing  specifications  to  req><i.r-: 
faster  execution,  eore  efficient  use  of 


storage,  addition  or  delecton  of 
processors  in  a  distributed  aystea, 
demand  for  a  aore  "friendly"  user 
interface,  etc.  Changing  performance 
requireaent  specifications  are  even 
aore,  prevalent  in  a  aodel  in  which 
prototypes  are  developed  rapidly  with 
successive  aodification  of  the 
prototypes  leading  to  deliverable 
products.  The  evolution  of  Ada 
software  projects  is  greatly  Influenced 
by  factors  peculiar  to  the  Ada  ailieu. 
These  factors  are:  standardisation  of 
the  language  before  the  advent  of 
usable  coapilera,  rapidly  evolving 
coapller  perforaance  (although 
coapilation  speed  and  quality  of  code 
is  still  not  at  a  very  high  level 
coapared  to  aore  aature  languages),  and 
lack  of  experienced  Ada  prograaaers 
(because  of  the  newness  of  the 
language) . 

It  la  clear  that  aetrics  used  only 
during  the  coding  phase  are  only  an 
approxlaatlon  to  any  quantitative 
evaluation  of  the  software  that  will 
eventually  be  produced.  A  aetric 
applied  only  at  one  point  in  the  life 
cycle  can  only  suggest  portions  of  the 
code  that  are  especially  coaplex.  Such 
aetrics  are,  by  their  nature.  Incapable 
of  aeasuring  those  portions  of  the  code 
which  are  inherently  coaplex.  Note 
also  that  the  goals  of  such  aetrics  is 
to  aid  in  the  developaent  of  code  of 
ainiaal  coaplexlty.  regardless  of  the 
changing  requlreaents  during  the  useful 
life  of  the  software  project. 

The  nature  of  the  tlae  variation  of 
metrics  is  the  theme  of  this  section. 

We  intend  to  return  to  the  topic  of 
evaluating  and  fine  tuning  such  metrics 
in  a  future  paper. 

With  these  suggestions  in  mind  we 
will  use  the  following  terminology. 

MS  -  the  aetric  used  at  the 
specification  stage 

MD  ■  the  aetric  used  at  the 
design  stage 

MC  -  the  aetric  used  to  evaluate 
completed  code  (In  the  case  of  a 
developing  system  with  many  prototypes, 
the  aetric  may  be  applied  to  each 
prototype) . 

MM  ■  the  aetric  used  during  the 
aaincenance'  cycle. 

It  is  natural  to  ask  if  the  same 
aetric  can  be  used  at  each  one  of  these 
four  stages  in  the  development  of  Ada 
programs;  the  answer  is  a  resounding 
no!  Among  other  things,  Ada  is  not  a 
formal  specification  language  since 
specifications  cannot  be  executed. 
(Historical  note  -  executable 
specifications  were  considered  in 
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several  of  Che  interim  reports  on 
Ada).  Since  most  software  metrics 
assume  that  language  level  is  a  major 
factor  in  the  metric,  there  cannot'  be  a 
suitable  measure  MS  unless  the 
specification  language  la  fixed.  For 
simplicity,  we  assume  that  the 
specification  language  is  English  and 
that  the  metric  MS  is  simply  defined  by 

MS-  number  of  tesks. 

We  are  assuming  that  the  number  of 
tasks  is  known  and  fixed  at  this 
stage.  Any  dynemically  allocated  tasks 
are  assumed  to  be  created  because  of 
performsnce  or  coding  criteria  in  later 
stages  of  software  development. 

Thus  the  meesure  et  this  point  is 
basically  an  "interconnection  metric" 
since  it  is  besed  on  the  number  of 
separately  executing  components  of  the 
progrem. 

The  next  metric  MO  is  more 
interesting,  because  many  people 
consider  Ada  to  be  a  reasonable  design 
language. 

We  assume  thet  MO  is  a  metric  which 
should  eccurately  reflect  the  type  of 
code  which  will  be  written  during  the 
coding  phase.  Thus  MO  and  MC  should 
have  high  correlation.  In  this  paper 
we  assume  that  ND-NC. 

The  metric  MC  used  at  the  end  of 
the  coding  stage  (or  at  the  end  of  the 
coding  stage  for  each  prototype) 
reflects  the  standard  use  of  metrics. 
That  is,  MC  is  a  static  measure  of  the 
code.  Its  use  hare  is  to  analyze  why 
certain  segments  of  the  code  were  herd 
to  write  and  to  predict  and  avoid 
problems  which  will  occur  during  the 
maintenance  stage.  Our  empirical 
evidence  (see  the  graph  in  Figure  1) 
supports  the  metric  MC  defined  by 

MC  -  2  "NUMBER  OF  TASKS  +3  "NUMBER 
OF  EXCEPTIONS  +  4  "(NUMBER  OF  ENTRY  + 
NUMBER  OF  ACCEPT  +NUMBER  OF  SELECT)  + 
NUMBER  OF  SHARED  VARIABLES  + 
NUMBER-OF-PRIVAT¥  TYPES  - 
(NUMBER  OF  MODES  IN  +  NUMBER_OF_ 
MODES_IN_OUT  +  NUMBER  0F_M0DES_0UT)/2  . 

The  Tinal  metric  MM  is  easier  to 
understand.  Since  each  metric  has  the 
dual  goals  of  measuring  the  current 
product  and  predicting  problem  in  the 
future,  MM  is  concerned  only  with 
problems  in  fixing  errors,  improving 
performance,  adding  functionality,  and 
in  the  code  being  ported  to  other  host 
systems.  The  last  three  of  these 
reflect  changes  in  the  design 
requirements  of  the  software.  At  this 
time  we  suggest  the  equation 


MM  -  MC+NUMBER  OF  PACKAGES  + 

4  *  NUMBER  OF  TASKS-  +- 
*  5  ♦  (NUMBER  OF_FOR_USE)  + 

3  *  ( NUMBER- OF  BINARY  +  NUMBER  OF  OCTAL 
♦  NUMBERaOF—HEX I DECIMAL  )  ~  - 

+  numberjjf_at_mod 

for  the  metric  MM. 

Consider  for  example  the  problem  of 
porting  software  which  allows  a  maximum 
of  S  independent  taska  from  a  system 
with  1  processor  to  a  system  with  10 
processors.  It  is  likely  that  such  a 
wealth  of  resources  will  cause  a  major 
change  in  the  software  if  performance 
improvements  are  not  as  expected. 

5.  LIMITATIONS  OF  ADA  SOFTWARE  METRICS 

The  following  seems  to  be  the 
minimum  requirement  used  for  any 
software  metric. 

Definition.  A  aoftwere  metric  m 
for  a  language  L  is  a  function  from  the 
set  of  programs  or  modules  written  in 
the  language  L  to  the  non-negative  real 
numbers. 

Note  that  this  definition  says 
nothing  about  the  input  to  a  metric 
being  a  correct  program  or  module. 

Note  also  that  metrics  often  have 
additional  properties.  For  example, 
Halstead's  metric  [4]  has  the  property 
of  additivity;  that  is.  if  A  and  B  are 
disjoint  nodules,  then 

m(A+B)  -  m( A )  ♦  m(B), 
and  this  is  independent  of  how  A  and  B 
are  interconnected.  Here  "A+B"  means 
the  joining  of  A  and  B  in  a  single 
program. 

The  cyclomatic  number  e-n+p  defined 
by  McCabe  [8]  requires  a  slightly 
different  analysis.  If  we  assume  that 
the  start  mode  of  B  is  identified  with 
a  node  of  A  and  that  no  other  nodes  or 
edges  are  common,  then  the  number  of 
nodes  of  A+B  is  one  less  then  the  sum 
of  the  number  of  nodes  of  A  and  the 
number  of  nodes  of  B.  Using  an  obvious 
notation , 

n(A+B)  -  n(A)  +n(B)  -1. 

Also,  the  number  of  connected 
components  of.  A,B,  and  A+B  are  related 
by 

p(a+B)  -  p ( A )  +  p(B)  -1 
and  hence  we  have 

m(A+B)  -  m( A )  +m(B). 

It  is  easy  to  see  that  this  is  true 
even  if  A  and  B  have  other  nodes  or 
edges  in  common  or  if  A  and  B  have  no 
nodes  in  common.  In  any  event, 

McCabe's  cyclomatic  number  satisfies 
the  additivity  condition 
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a(A+B)  -  b(A)  *  b(B). 

What  about  interconnection 
aatrica?  Such  metrics  aaauaa  that  the 
priaary  factor  influencing  coaplexity 
ia  the  interconnectiona  between 
aodules.  The  nuaber  of  interconnections 
increases  exponentially  as  the  nuaber 
of  aodules  increases.  Hence  the 
additivity  condition  is  replaced  by  the 
condition 

a(A+B)>  a( A)  ♦  a(B) 
where  "A+B"  now  represents  a  prograa 
with  aodules  A  and  B  (which  of  course 
aay  also  be  coaposed  of  other 
nodules).  Thus  these  aetrics  fail  to 
hews  the  property  of  "subsdditlvlty" , 
auch  less  additivity.  This  is  a  aajor 
reason  for  the  lack  of  a  well-defined 
theory  of  such  aetrics  for  general  use 
in  evaluating  software  written  in  aost 
prograaalng  languages. 

The  situation  for  Ada  software 
aetrics  is  soaewhat  different  froa  the 
situation  for  general  prograaalng 
language  aetrics.  As  was  indicated  in 
section  3,  the  interfaces  between 
coaponent  aodules  ere  tightly 
controlled  by  the  standard  interfaces, 
such  as  restricting  aodes  in  functions 
to  "in",  "out",  or  "in  out".  For 
separate  packages,  there  are  no  coaaon 
variables  unless  the  coaaon  variables 
are  declared  by  the  pragaa  SHARE  in 
each  package.  Thus  .Ada  aetrics  when 
applied  to  structured  code  have  the 
additivity  property  that  the  aeasure  of 
a  prograa  coaposed  of  two  separate 
prograa  units  is  equal  to  the  sua  of' 
the  aeasures  of  the  two  prograa  units. 
Hence  it  is  reasonable  to  suspect  that 
there  is  soae  aatheaatlcsl  structure 
underlying  the  theory  of  Ada  software 
aetrics  of  the  type  presented  here. 

It  is  iaportant  to  distinguish  the 
factors  that  are  to  be  aessured  by  a 
aetrlc.  Such  factors  include  the 
language  level,  underlying  coaplexity 
of  the  problea,  experience  and  ability 
of  the  prdgraaaer,  relative  frequency 
of  expected  errors  in  certain  code 
segaents,  and  degree  of  difficulty  in 
iapleaentlng  the  changes  in  the  next 
phase  of  the  software  life  cycle. 
Metrics  are  applied  to  code  for  the 
purpose  of  evaluating  the  code  and 
predicting  those  portions  which  will  be 
troublesoae  in  the  next  phase  of  the 
life  cycle. 

With  these  points  in  Bind,  we  Bake 
the  following  definition.  We  define  an 
Ada  software  aetric  scheae  to  be  a 
quadruple  (as,  ad,  ac,  aa)  of  functions 
called  aetrics  whose  range  is  a  subset 
of  the  set  of  real  nuabera  and  which 
satisfy  the  following  conditions: 


a.  The  domain  of  as  .is  the  set  of 
all  possible  specification  language 
(which  aay  be  English,  formal 

specif icaiton  language  with  executable 
code,  or  soaething  in  between). 

b.  The  doaains  of  ad,  ac  and  aa 
ara  the  sat  of  all  possible  Ada 
prograas  (asauaing  Ada  is  the  design 
language) . 

c.  Each  of  the  metrics  ad,  ac,  and 
aa  is  the  sua  of  two  non-negative 
functions.  These  functions  are  the 
P-aeaeurea  which  represent  the  aetric 
applied  only  to  those  portione  of  the 
coda  that  are  direct  translations  of 
Pascal  prograas  and  the  A-functiona 
which  represent  the  coaplexity  of  the 
code  caused  by  Ada-specific  features 
not  present  in  the  language  Pascal. 

Defining  a  aetric  acheae  aa  a 

collection  of  four  aetrics  seeas 
soaewhat  redundant  at  first  glance. 

Much  of  the  research  in  software 
aetrics  involves  a  search  for  a  single 
aessureaent  to  be  used  at  all  tiaes. 
However,  Dunsaore  and  Cannon  [3] 
perforaed  an  interesting  experiaent  in 
the  use  of  global  variables  and  foraal 
paraaeters  in  coaaunicating  between 
varfoua  aodules.  They  observed  that 
global  variables  tend  to  decrease 
errors  during  prograa  developaent  but 
that  foraal  paraaeters  tend  to  decrease 
errors  during  the  aaintenance  phsse. 
Their  results  support  the  need  for 
different  aetrics  during  different 
phases  of  the  life  cycle. 

Recall  that  in  section  3  we 
observed  that  any  aetric  for  Ada 
programs  Bust  consider  the  typing, 
tasking,  packages,  aodes,  generics, 
exceptions  and  other  special  features 
of  Ada.  Clearly  these  aust  be  present 
in  the  A-function  since  they  are  not 
available  in  the  language  Pascal.  We 
are  now  ready  to  state  and  prove  the 
aajor  result  which  shows  the- liaitation 
of  Ada  software  aetrics. 


THEOREM  It  is  inpossible  for  any 
aetric  a  applied  to  any  stage  of  the 
software  life  cycle  of  Ada  prograas  to 
be  able  to  predict  the  coaplexity  of 
the  code.  In  fact,  given  any  Ada 
software  aetric  scheae,  there  is  a 
prograa  E  for  which  each  of  the  aetrics 
in  the  scheae  has  a  fixed  value  when 
applied  t.o  the  code  but  the  aeasures 
applied  to  E  during  execution  can  grow 
arbitrarily  large. 
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Proof.  Consider  a  prograa  E  which 
involves  code  of  che  fora 
taek  type  T_TYPE  is 

type  TAsic.1  POINTER  is  access  T_TTPE; 

Any  aetrlc  applied  to  the  prograa  E 
at  any  stage  in  the  software  life  cycle 
will  esalgn  a  fixed  non-negative  nnaber 
to  E.  However ■  tasks  can  be  apawned 
dynaalcally  at  rnn-tiae.  The  nuaber  of 
casks,  and  hence  the  nnaber  of 
interconnections  between  tasks,  can  be 
made  arbitrarily  large  at  run  tiae. 

Hence  any  coaplexity  aeasure,  when 
applied  to  the  code-data  pair  during 
runclae ,  can  be  aade  arbitrarily  large 
even  though  the  value  of  the  aeasure  on 
E  before  run  tiae  was  fixed. 


This  theorea  shows  that  it  is 
iapossible  for  any  aetrlca  which  can  be 
applied  to  every  prograa  at  any  stage 
of  che  life  cycle  to  be  able  to 
precisely  predict  probleas  in 
coaplexity  of  the  code.  There  are 
several  possible  options  to  partially 
resolve  this  situation. 

1.  Continue  to  apply  these  natrics 
(or  later,  aore  polished  ‘versions)  to 
Ada  prograas  racognixing  that  these 
aetrics  cannot  give  coaplete 
inforaation  on  all  Ada  prograaa. 

2.  Apply  these  aetrics  only  to  s 
subset  of  all  possible  Ada  prograas. 
This  procedure  is  analogous  to  the 
situation  in  the  branch  of  aatheaatics 
called  aeasure  theory  where  aeasures 
are  not  applied  to  all  secs. 

3.  Partition  the  aetrics  as,  ad, 
nc  and  aa  into  a  collection  of 
functions,  not  all  of  which  can  be 
applied  to  all  prograas.  These 
functions  will  Incorporate  the  ideas  in 
section  3. 

6.  CONCLUSIONS  AND  DISCUSSION  OF 
FURTHER  RESEARCH 

This  paper  had  two  aain  theaes: 
developaenc  of  software  aetrics  for  Ada 
prograas  and  deteralning  the 
liaitatlons  of  such  aetrics.  Research 
on  the  developaent  of  aetrics  involved 
exaaination  of  several  classical 
aetrics  on  a  saaple  of  short  Ada 
prograas.  Results  obtsined  suggested 
soae  aetrics  for  Ada  prograas  that  are 
incorporated  into  Ada  aetric  scheaes. 
These  aetric  scheaes  are  quadruples  of 
aetrics  (ad.^ad,  ac,  aa)  which  are 
applied  during  the  specification, 
design,  coding,  and  aaintenance  phases 
of  che  life  cycle.  Reseach  will 


continue  into  such  aetric  scheaes  and 
their  validity  as  predictors  of 
probleas  with  progtaas  at  various 
points  in  the  software  life  cycle. 

The  second  aain  these  of  this  paper 
was  the  theoreticsl  liaitatlons  of  such 
aetrics.  We  proved  a  theorea 
indicating  that.no  aetric  applied  to 
static  code  can  predict  code  coaplexity 
for  prograaa  which  change  their 
coaplexity  at  runtiae.  The  proof  of 
the  thorea  is  based  on  a  particular  Ada 
concept.  Future  research  in  this  area 
will  concentrate  on  extending  this 
theorea  to  deteraine  which  other 
properties  of  Ada  prograas  cause 
siallar  difficulties  with  Ada  aetrics. 

Representing  aetrics  as  the  sua  of 
P-aetrics  (for  Pascal-like  prograa 
features)  and  A-aetrics  (for  prograa 
features  available  in  Ada  but  not  in 
Pascal-like  languages)  is  a  first  step 
in  this  research. 
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ABSTRACT 

A  major  feature  in  the  design  of  Ada  was 
the  high  level  support  of  concurrent  tasks.  Con¬ 
current  tasking  is  an  essential  feature  of  embed¬ 
ded  systems  in  most  environments.  In  this 
paper  we  examine  the  state  of  Ada  education  in 
the  support  of  concurrent  tasking.  The  tasking 
examples  are  compared  to  formal  models  in 
C.A.R.  Hoare's  CSP  (Communicating  Sequential 
Processes)  system.  The  resulting  information  is 
compared  with  actual  use  of  tasking  programs  in 
the  Ada  literature  and  in  industry  and  govern¬ 
ment  Particular  attention  is  paid  to  the  treat¬ 
ment  of  non-determinism  in  tasking  programs 
and  in  formal  models. 


INTRODUCTION 

In  order  to  maximize  the  effectiveness  and 
efficiency  of  a  program,  a  programmer  must 
begin  the  program  development  with  a  good 
program  design  structure.  Programs  are  made 
up  of  several  types  of  building  blocks.  Pro¬ 
grams  without  concurrent  execution  of  tasks  use 
the  standard  sequential  building  blocks  of  pro¬ 
cedures,  functions,  and  modules.  Programs 
involving  concurrent  execution  use  these  build¬ 
ing  blocks  and  the  additional  block  of  a  task 
which  is  usually  a  collection  of  the  sequential 
building  blocks.  The  use  of  concurrent  tasking 
in  programs  greatly  increases  the  potential  for 
error  in  programs  and  thus  causes  great 
difficulty  during  all  phases  of  the  software  life 
cycle.  Errors  which  occur  at  many  phases  of 
the  software  life  cycle  and  costs  which  increase 
exponentially  are  major  features  of  the 
"software  crisis".  It  is  clear  that  the  current 
"software  crisis"  will  get  even  worse  since  most 
of  the  existing  problems  have  been  with  systems 
which  do  not  involve  much  concurrent  execu¬ 
tion. 


Abstraction  and  information  hiding  are 
major  techniques  of  software  engineering  that 
are  used  to  address  some  of  the  problems  with 
software.  Indeed,  the  relative  ease  in  which 
information  hiding  and  abstraction  of  data  are 
implemented  in  the  Ada  language  is  a  major 
reason  for  the  success  of  Ada.  It  is  clear  from 
the  success  of  these  language  features  that  there 
is  a  need  for  formalism  in  the  area  of  concurrent 
programming  in  Ada. 

Perhaps  the  most  common  paradigm  for 
design  of  programs  involving  concurrent  tasks  is 
C.  A.  R.  Hoare’s  Communicating  Sequential 
Processes  (CSP).  It  is  reasonable  to  ask  if 
Hoare’s  abstract  models  of  CSP  involving  con¬ 
currency  are  applicable  and  effective  tools  in 
program  design.  Ada  was  originally  intended 
for  use  with  embedded  systems  and  concurrent 
tasking  and  to  incorporate  principles  of  good 
software  engineering;  it  is  appropriate  at  this 
point  to  examine  how  these  two  ideas  wok 
together  in  practice.  This  research  was  con¬ 
ducted  to  see  if  the  current  state  of  use  of 
abstract  models  of  Ada  programs  involving  con¬ 
current  tasking  is  sufficiently  well-understood  to 
be  used  in  providing  a  basis  for  Ada  program 
design.  Thus  this  research  represents  an  assess¬ 
ment  of  how  well  the  use  of  tasking  and  formal  . 
models  is  supported  in  the  existing  Ada  educa¬ 
tional  community. 

The  first  step  in  approaching  this  problem 
was  to  collect  data.  The  data  was  initially  col¬ 
lected  from  the  published  literature  of  Ada  pro¬ 
grams  including  textbooks,  lecture  notes,  and 
conference  proceedings.  We  chose  17  texts 
from  the  library;  the  selection  criterion  was 
actually  having  the  book  on  the  shelf  and  not  in 
circulation  at  the  time  that  the  data  was  gath¬ 
ered.  We  feel  that  this  is  a  representative  sam¬ 
ple  of  the  use  of  tasking  in  the  existing  Ada 
textbook  literature.  Programs  which  involved 
tasking  were  extracted  and  examined  to  see 


ered.  We  feel  that  this  is  a  representative  sam¬ 
ple  of  the  use  of  tasking  in  the  existing  Ada 
textbook  literature.  Programs  which  involved 
tasking  were  extracted  and  examined  to  see 
which,  if  any,  of  Hoare’s  models  could  be 
applied  to  the  programmers’  method  of  execut¬ 
ing  the  task  or  tasks.  The  results  were  tabulated 
to  see  which  models  were  applied  in  these  pro¬ 
grams. 

The  textbooks  examined  fall  into  two 
categories:  limited  amounts  of  tasking  (includ¬ 
ing  none  at  all)  and  considerable  emphasis.  A 
total  of  819  programs  from  all  textbooks  were 
examined,  with  only  114  or  13.9%  having  any 
concurrent  tasks.  We  note  that  most  of  the  pro¬ 
grams  involving  tasking  (36)  were  found  in  a 
single  reference  [7].  Of  the  programming  sam¬ 
ples  obtained  from  textbooks  in  the  first 
category,  there  were  only  32  programs  involving 
tasking  out  of  a  total  of  730  or  4.3%.  The  PIIQ 
model  of  concurrent  execution  of  tasks  with  no 
communication  between  die  tasks  was  found 
most  often,  with  a  total  of  14  instances,  of 
which  12  involved  only  two  tasks.  The  repeti¬ 
tion  of  tasks,  which  is  denoted  abstractly  as  *P, 
was  the  next  most  frequently  found,  with  seven 
instances.  The  next  most  frequent  model  occur¬ 
ring  is  the  P;Q  model  in  which  the  tasks  actu¬ 
ally  are  executed  in  order,  a  total  of  6  instances. 
Hoare  distinguishes  29  distinct  models  for  task¬ 
ing  involving  two  tasks;  only  8  of  them  or 
27.5%  are  represented  in  the  texts.  In  table  1 
below,  we  summarize  our  search  of  the  die  text¬ 
book  literature,  some  examples  of  student  pro¬ 
grams.  and  sample  programs  that  are  available 
in  the  non-textbook  Ada  literature.  Note  that 
we  show  the  number  of  tasks  in  each  example 
and  therefore  do  not  quite  agree  with  all  of 
Hoare’s  categories,  since  Hoare  only  lists  the 
possibilities  for  the  execution  of  two  concurrent 
tasks  in  his  explicit  listing  of  possibilities. 

The  textbooks  [5]  and  [7]  had  much  more 
emphasis  on  concurrent  programming  as  the 
tides  "Concurrent  Programming  in  Ada"  and 
"Parallel  Programming  in  ANSI  Standard  Ada" 
would  indicate.  There  were  a  total  of  89  pro¬ 
grams  presented  with  tasking  evident  in  42  or 
51.7%.  Here  the  range  of  programs  is  much 
wider  including  examples  of  (PIIQ)*,  $$(P  sub  1 
IIP  sub  2  ..IIP  sub  n  )*  $$,  PIIQ  with  Q  of  the 
form  R!IS;T  and  several  other  models. 

The  next  set  of  data  was  obtained  from 
student  programs.  The  intention  here  was  to 
measure  the  level  in  which  tasking  is  used  in 


such  programs.  A  preliminary  experiment 
involving  the  examination  of  12  student  pro¬ 
grams  using  concurrency  indicated  that  the 
sequential  execution  of  tasks  predominated,  with 
8  uses  of  the  P;Q  model  of  sequential  non¬ 
communicating  tasks,  2  with  parallel  execution 
of  non-communicating  tasks  (PIIQ),  one  with  the 
(*(*(*p.Q)))  mode]  0f  repeated  sequential  tasks, 
and  one  with  the  P;*Q  model  of  task  followed 
by  repetition  of  a  sequential  task.  Some  of  the 
programs  obtained  from  students  followed  the 
P;Q  model  which  describes  the  execution  of  two 
processes  or  tasks  which  are  executed  sequen¬ 
tially.  Some  observations  about  the  difficulties 
encountered  by  students  in  the  development  and 
execution  of  these  programs  was  made  in  [14], 

An  additional  data  set  was  obtained  from 
the  existing  published  non-textbook  literature. 
This  data  was  obtained  from  the  newsletter 
AdaLe tiers  (including  its  predecessor),  proceed¬ 
ings  of  several  Ada  conferences,  the  Journal  of 
Pascal,  Ada,  and  Modula-2,  materials  from  a 
variety  of  Ada  short  courses,  and  the  Ada  Repo¬ 
sitory.  Again  in  this  case,  few  of  the  sample 
programs  supported  the  more  complex  models. 

The  most  common  example  of  Ada  task¬ 
ing  programs  was  the  consumer-producer  prob¬ 
lem  which  was  presented  in  various  forms. 
Many  texts,  especially  [7],  gave  several  different 
solutions  to  this  problem.  In  some  instances, 
there  were  two  relatively  different  coding  solu¬ 
tions  to  the  same  abstract  model,  even  though 
the  two  models  appeared  to  have  the  same  CSP 
representation.  We  intend  to  pursue  this  subject 
in  future  work. 

The  remaining  data  was  collected  from  a 
small  set  of  programs  actually  used  in  industry 
and  government  Some  of  these  programs  make 
elaborate  and  extensive  use  of  tasking  while  of 
course  others  do  not  The  data  collected  is 
incomplete  at  this  point  because  of  the  difficulty 
in  obtaining  samples  of  actual  proprietary  code. 
We  do  not  expect  that  this  data  will  ever  be 
complete  or  that  it  will  represent  the  precise 
percentages  of  use  of  Ada  tasking  in  Ada  pro¬ 
grams.  Instead,  we  consider  it  as  an  example  of 
how  Ada  tasking  paradigms  are  used  in  a  few 
hopefully  representative  Ada  applications. 
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TABLE  1:  READILY  AVAILABLE  INFORMATION  ON  TASKING.  NOTE  THAT  SOME  OF 
THE  DATA  COULD  ALSO  BE  CONSIDERED  AS  MORE  COMPLICATED  CSP  MODELS 


■ 


SUMMARY  AND  CONCLUSION  gramming  must  be  increased.  This 

It  is  clear  that  the  quality  of  information  instruction  should  be  done  over  a  variety 

available  to  beginning  and  intermediate  Ada  courses  50  ^at  students  see  these  ideas 

programmers  and  designers  about  tasking  is  a  num'5er  °f  contexts, 

quite  limited  and  does  not  address  the  full  range  2.  Textbooks  in  the  language  Ada  must 

of  potential  tasking  uses.  The  actual  problem  is  include  a  wider  variety  of  tasking  pro- 

much  worse  than  this  because  Hoare’s  CSP  grams  including  more  of  Hoare’s  CSP 

models  do  not  allow  for  time  constraints  such  as  models.  While  the  amount  of  tasking 

delays  and  fixed  waits.  Such  factors  are  criti-  information  need  not  be  as  much  as  in  [7], 

cally  important  in  situations  such  as  the  FAA  it  must  be  increased  in  order  to  make 

control  system  or  indeed  in  any  system  that  sophisticated  knowledge  of  Ada  tasking 

must  perform  in  real  time.  available  to  as  many  students  as  possible. 


It  is  well-known  that  even  experienced 
programmers  have  considerable  difficulty  in 
writing  programs  which  involve  any  degree  of 
concurrency.  We  recommend  the  following 
solutions. 

1.  At  the  preliminary  level  of  education; 
that  is,  in  the  undergraduate  and  graduate 
programs  of  colleges  and  universities,  the 
amount  of  instruction  in  concurrent  pro- 


3.  Continuing  education  for  the  profes¬ 
sional  should  include  a  comprehensive 
study  of  tasking  in  Ada.  This  is  not 
appropriate  for  the  first  introduction, 
which  should  be  limited  to  the  fundamen¬ 
tal  features  of  the  language  and  Ada 
software  engineering  with  only  a  brief 
introduction  to  tasking.  Second  courses 
should  give  views  of  many  abstract 
models  of  tasking  by  means  of  many 
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different  examples.  We  note  that  this  is 
being  done  at  Pennsylvania  State  Univer¬ 
sity  (Capitol  Campus)  and  at  Computer 
Science  Corporation  (Moorestown). 

4.  In  the  absence  of  high  quality  educa¬ 
tional  opportunities  or  having  existing  per¬ 
sonnel  already  well  trained  in  Ada  task¬ 
ing,  management  must  choose  between 
using  special  expertise  from  outside  the 
organization  and  restricting  the  tasking  to 
the  simple  models  supported  by  most  of 
the  existing  texts. 
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ABSTRACT 

We  consider  several  sequential  and  parallel  algorithms  for  the  evaluation  of 
polynomials  of  low  degree,  with  particular  emphasis  on  those  that  are  used  frequently 
in  computer  graphics.  A  complete  accounting  of  computation  times  for  the  speed-up 
and  efficiency  of  these  algorithms  is  reported.  The  results  are  compared  to  standard 
primates  of  these  quantities  for  single  and  multi-processors  using  classical  complexity 
theory.  A  simulator  which  is  configurable  to  several  parallel  architectures  is  used  to 
provide  validation  of  the  results  obtained. 

1.  INTRODUCTION 

It  has  been  clear  for  several  years  that  major  improvements  in  execution  time  for  many  programs 
will  require  extensive  use  of  parallel  processing.  Many  papers  have  been  written  exploring  the  compu¬ 
tational  complexity  of  algorithms  which  are  developed  for  parallel  computation.  The  complexity  is  usu¬ 
ally  measured  on  some  abstract  machine  which  has  certain  properties  that  are  assumed  to  be  somewhat 
realistic.  In  general,  these  theoretical  results  do  not  have  a  particularly  good  correlation  with  observed 
execution  times  on  actual  hardware  realizations  of  these  abstract  parallel  machines.  The  paper  [4]  is  a 
typical  example.  Typically  the  quality  of  a  parallel  algorithm  is  measured  by  two  quantities  called  the 
efficiency  and  the  speed-up.  Speed-up  is  defined  by  the  formula 
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s(p)  =  T(iyr(p) 

while  efficiency  is  defined  by 

E(p)  =  S(p)/p. 

where  p  denotes  the  number  of  processors.  These  measures  are  well-defined  for  any  given  algorithm 
provided  that  all  of  the  times  involved  in  the  computation  are  taken  into  account  Typically,  the  "best" 
sequential  algorithm  is  used  to  compute  T(l).  The  articles  [2],  [5],  and  [9]  present  various  views  of 
what  the  actual  speed-up  of  an  algorithm  is.  In  [9],  Parkinson  claimed  that  a  particular  parallel  algo¬ 
rithm  for  adding  two  vectors  has  efficiency  greater  than  1.  This  analysis  was  disputed  by  [2]  and  [5], 
where  the  authors  indicated  that  certain  implicit  assumptions  were  made  by  Parkinson.  They  described 
other  factors  involving  actual  performance  on  any  hardware  realization  of  an  abstractly  described  paral¬ 
lel  computer. 

The  controversy  over  this  simple  algorithm  suggests  that  some  of  the  classical  results  of  arith¬ 
metic  complexity  theory  be  reviewed  from  the  point  of  view  of  the  actual  times  needed  for  performance 
of  needed  operations  in  an  arithmetic  computation.  In  this  paper  we  are  concerned  with  the  evaluation 
of  polynomials.  We  will  consider  a  number  of  algorithms  for  evaluation  of  low  degree  polynomials  and 
obtain  estimates  of  run  time  speed  using  techniques  of  classical  arithmetic  complexity  theory.  The 
actual  numbers  of  memory  accesses,  register  moves,  index  changes,  arithmetic  operations  ,  and  inter¬ 
process  communications  will  be  given  and  will  be  translated  to  actual  efficiency  and  speed-up  for  a 
number  of  parallel  architectures. 

We  restrict  our  attention  to  low  degree  polynomials  in  this  paper  for  several  reasons.  First,  the 
actual  time  costs  of  all  of  the  operations  are  apparent  in  low  degree  polynomial  evaluation.  Second,  our 
intention  was  to  compare  actual  to  theoretical  results.  It  is  easy  to  do  this  for  low  degree  polynomials. 
Finally,  real-time  graphics,  which  is  one  of  the  most  computationally  intensive  fields  and  is  a  typical 
target  for  parallel  computation,  is  concerned  almost  exclusively  with  the  evaluation  of  cubic  polynomi¬ 
als  or  quotients  of  cubic  polynomials  when  solid  objects  are  displayed.  For  additional  information,  see 
the  reference  [3]  from  which  the  following  discussion  of  the  use  of  low  degree  polynomials  in  computer 
graphics  is  taken. 


Low  degree  polynomials  appear  naturally  in  computer  graphics  in  the  following  context  Suppose 
that  the  graph  of  some  surface  is  to  be  displayed.  The  surface  is  approximated  by  a  collection  of 
patches  which  are  defined  by  a  collection  of  quadruples  of  points.  These  quadruples  may  represent 
points  on  the  surfaces  or  some  combination  of  points  on  the  surface  or  points  of  tangency,  or  points  that 
are  chosen  to  represent  the  smoothness  of  the  surface  without  lying  precisely  on  the  surface.  The  patch 
is  given  in  parametric  form  x  =  x(u,v),  y  =  y(u,v),  z  =  z(u,v),  where  u  and  v  are  restricted  to  lie  in 
some  region  which  is  almost  always  the  unit  square  0  £  u  ,  v  £  1.  The  functions  x(u ,v),  y(u,v),  and 
z(u,v)  are  either  products  of  polynomials  of  degree  at  most  3  or  are  quotients  of  such  products.  The 
reason  for  this  is  technical  and  involves  having  exactly  the  minimum  number  of  degrees  of  freedom  to 
allow  for  smoothness  when  two  patches  are  joined. 

The  graph  of  the  surface  is  approximated  near  the  patch  by  the  following  algorithm. 

FOR  each  patch  DO 

/*  evaluate  the  patch  for  curves  of  constant  u  */ 
for  (u  =  0;  u  £  1;  u  -  u  +  increment) 
move_abs_  3(x  (u  ,0),  y  (« ,0),  z  (« ,0)); 
for  (v  =  increment ,  v  £  1,  v  =  v  +  increment ) 
line_abs_  3(x  (« ,v  ),  y  (u  ,v ),  z  (« ,v  )); 

f*  evaluate  the  patch  for  curves  of  constant  v  */ 
for  (v  =  0;  v  £  1;  v  =  v  +  increment ) 
move_abs_  3(x  (0,v ),  y  (0,v ),  z  (0,v )); 
for  (u  =  increment ,  u  £  1,  u  =  u  +  increment) 
line_abs_  3(x  (u  ,v  ),  y  (u  ,v  ),  z  (u  ,v  )); 

Each  patch  requires  2 (increment)'2  evaluations  of  the  functions  x(u,v),  y(u,v),  z(u,v)  for  a  total 
of  6(increment)~ 2  evaluations  of  functions.  A  typical  realistic  picture  in  computer  graphics  may  contain 
between  500  and  2000  or  more  patches  and  may  require  values  of  increment  of  .001.  This  corresponds 
to  1.2*  1012  possible  function  evaluations  of  the  three  component  functions  x ,  y  ,  and  z.  Clearly  a  crit¬ 
ical  factor  is  the  speed  in  which  the  functions  can  be  evaluated.  More  importantly,  the  large  number  of 
computations  needed  to  solve  this  problem  strongly  suggests  a  need  for  some  of  the  computations  to  be 
performed  in  parallel . 
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2.  SOME  STANDARD  RESULTS  FROM  COMPLEXITY  THEORY 


We  recall  a  few  results  from  classical  arithmetic  complexity.  Let  n  be  a  positive  integer  and  con¬ 
sider  the  polynomial 

p(x)  =  a0  +  a,x  +  ..■a.x*  . 

Suppose  that  we  wish  to  evaluate  this  polynomial  on  a  single  processor.  To  evaluate  p(x)  for  a  given  x 
requires  the  computation  of  x"  and  all  lower  powers  of  x.  The  most  obvious  algorithm  requires  2n  -  1 
multiplications  and  n  additions.  A  faster  procedure  is  Homer’s  method  where  we  write 


p(x)  =  (...((a„x  +a,_i)x  +  a._2>x...)  +  a„  . 

This  requires  n  multiplications  and  n  additions.  Knuth  [6]  gives  an  algorithm  due  to  Belaga  which 


requires 


+  2  multiplications  and  n  additions.  See  [7]  and  [10]  for  other  results  in  this  area. 


For  parallel  evaluation  of  polynomials,  some  major  results  are  due  to  [7]  ,  [8],  and  [10].  These 
results  suffer  from  some  of  the  same  difficulties  as  the  previously  mentioned  work  on  the  efficiency  and 
speed-up.  See  the  paper  [1]  for  an  analysis  of  some  of  the  difficulties  involved. 


3.  PARALLEL  ALGORITHMS  USED  FOR  POLYNOMIALS  IN  A  SINGLE  VARIABLE 

J 

In  this  section,  we  describe  algorithms  (both  sequential  and  parallel)  for  evaluation  of  low  degree 
polynomials  in  one  variable.  In  order  to  conserve  space,  we  have  omitted  each  case  of  Homer’s  algo¬ 
rithm  for  single  processors  and  have  followed  the  notational  conventions  of  using  the  terminology  of 
the  original  papers  with  "pre-processing'’  and  not  explicitly  writing  obvious  communications  between 
processors. 

A22:  (Homer’s  method,  quadratic  polynomial,  two  processors) 

PROCESSOR  1  PROCESSOR  2 
aye  a,x 

aye2  a^  +  a0 

aye 2  +  a\x  +  a0 


for  second  degree  polynomials, 
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A3.1a:  (Knuth’s  method,  cubic  polynomial,  single  processor) 

y  =  x  +  c 
w  =  y2 

>2=  >V  - 

z  =  flay 
2  =2  +po 
^2 


A3 .2:  (Homer’s  method,  cubic  polynomial,  two  processors) 

PROCESSOR  1  PROCESSOR  2 

x2  x 2 

a&2  ajx2 

aix2  +  a0  ojr2  +  fli 

(a3x2  +  a1)x 

fl2X2  +  aoHaaX2  + 


A3 .2a:  (Knuth’s  method,  cubic  polynomial,  two  processors) 


PROCESSOR  1  PROCESSOR  2 
y  =x  +  c  y  =  x  +  c 

y2  ajy 

y2-(*i  z  =  a#  +  Pq 

z(y2~  Oi) 


for  third  degree  polynomials  and 


A4.1a:  (Knuth’s  method,  quartic  polynomial,  single  processor) 

y  -x  +  c 
2 

w  =  y 
w  -  Oj 
a+y 
+  Oo 

(a*y  +  a o)y 
z  =  (o<y  +  ao)y  +  Po 
z(w  -  at) 
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A42:  (Homer’s  method,  quartic  polynomial,  two  processors) 


PROCESSOR  1 
x2 

a#2 

a4X2  +  02 
( a 4X2  +  a^x2 

(a#2  +  a£x2  +  a0 

(04X2  +  aj)x2  +  a0  +  (o^x2  +  fli)x 


PROCESSOR  2 

x2 

flax2 

fljX2  +  a\ 

(fljX2  +  U])x 


A4.2a:  (Knuth’s  method,  quartic  polynomial,  two  processors) 
PROCESSOR  1  PROCESSOR  2 

X  -KXo  X  +  02 

z  =  (x+Oo)x  delay 

y  =  z  +  at  delay 

x  +(X2  +  y 
(x  +  02  +  y)y 
(x  +  ct2  +  y)y  +  a3 
((x+y+a2)y  +  a3)a» 


A4.2b:  (Knuth’s  method  with  modifications,  quartic  polynomial,  two  processors) 

PROCESSOR  1  PROCESSOR  2 
y=x+c  y  =  x  +  c 

y 2  fl<y 

y2-<*i  a#  +  Oo 

delay  (a<y  +  a o)y 

2  =  (a<y  +  ao)y  +  Po 
x(y2-ai) 

for  fourth  decree  polynomials. 

These  algorithms  were  implemented  on  a  variety  of  distributed  systems  with  processing  elements 
of  several  different  architectures  .  The  results  are  given  in  the  next  section. 

In  each  case,  the  processors  used  in  the  distributed  system  were  identical.  The  experiment  was 
repeated  for  each  algorithm  so  that  some  independence  of  processors  was  obtained.  Assembly  code  was 
not  optimized  by  any  unusual  tricks  but  was  given  with  the  intention  of  imitating  typical  code  generated 
by  a  compiler.  Thus  the  results  should  be  typical  of  the  situation  actually  encountered  in  practice,  espe¬ 
cially  since  the  general  problem  of  efficiently  using  registers  for  computation  on  an  arbitrary  computer 
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is  extremely  difficult. 

Suppose  we  define  a  processing  element  as  a  triple  (CPU,  memory,  single  data  channel-in,  single 
data  channel-out).  The  results  of  this  paper  are  described  in  the  following  theorem.  We  will  need  to 
use  the  following  notation: 

-  time  for  addition/subtraction, 
rw.  =  time  for  multiplication/division, 
ti-i~  =  time  for  array  index  access, 
t„,jm  =  time  for  memory  access, 
thcr  =  time  for  loop  control 

tcemm  =  ^  for  communication  between  two  adjacent  processors. 

Theorem  1  :  Let  F  be  an  arbitrary  polynomial  of  degree  2,  3,  or  4  in  a  single  real  variable  u .  Suppose 

also  that  F  has  real  coefficients.  Let  E(p,n)  and  S(pji)  be  the  efficiency  and  speed-up  for  the  problem 

of  evaluating  F(u)  at  a  real  variable  u  on  a  p-processor  non-pipelined  system  for  a  polynomial  of 
degree  n .  Then 

5(2,2)  S  (2/ttU  +  It  mult  +  fifiMm  +  2JukUz  )  I  Uadd  +  +  timdax  +  *c omm) 

£(2,2)  =  S(2,2)/2 

5  (2,3)  5  ( ,3tadd  +  3 tnut/i  +  7tMm  +  3fjn^ny(2ra(u  ^malt  +  “^utdax  l comm ) 

5(23)  S  ("it add  +  'it  mull  "*■  ItuHm  +  ^tiadti  V (2todd  +  ^rntdl  +  +  2l«a«x  +  ^comm ) 

£(2,3)  =  S(23V2 

5  (2,4)  £  +  4tmijl  +  9rmmi  +  4fufc^r  )/(3:a^  +  3(^1  +  fit  mm  +  2r^(X  +  tcomm ) 

£(2,4)  =  S  (2,4)/2 


Proof. 


Algorithms  A2.1,  A3.1,  and  A4.1  represent  Homer’s  method  on  one  processor.  Note  also  that  we 
can  reduce  the  number  of  index  accesses  by  one  if  we  note  that  o0  is  typically  in  the  first  memory  cell 
devoted  to  the  array  of  coefficients.  We  have  the  following  table  which  describes  the  total  time  needed 
for  evaluation  of  the  polynomial  £ . 
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ALGORITHM 

A2.1 

A2  2 

A3.1 

A3.1a 

A3  2 

A3  2a 

A4.1 

A4.1a 

A42 

A42a 

A42b 


TOTAL  TIME 

2 +  2/^  +  SIm,  +  2/**, 

'odd  +  ^timib  +  3fMCTI  +  'udo  +  'comm 

3t-jj  +  3  tmt&  +  7tmrm  +  3  tf-rf— 

3  +  3tnhj.  +  ltnrm  +  3f..  j.. 

2/«*j  +  3/«d,  +  4t _ +  2/.-^.  + 

2*««  +  21^  +  St***  +  2/ia^x  +  'com. 

4(«du  +  4(mm^  +  Qt-nf-  +  4f.— i— 

+  4rMaA  +  9tmr.  +  4 

3'a^d  +  3t _ i.  +  6tmtm  +  2/..J—  +  >L1M„|, 

4t^<  +  3tnu/f  +  9tmem  +  4f,nrfrr  +  >-||,n, 

3/— aj  +  3f— ji  4-  8/n,,-,  +  +  teomM 


Examination  of  the  table  completes  the  proof  of  the  theorem. 
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Remarks: 

1.  These  bounds  on  the  efficiency  and  speed-up  are  not  increased  if  we  allow  each  of  the  proces¬ 
sors  to  have  registers  with  faster  access  time  than  normal  memory  access. 

2.  The  performance  algorithms  can  be  speeded  up  if  we  feed  in  data  via  a  pipeline.  This  situation 
was  not  described  by  the  theorem  since  it  violates  the  single  data  channel-in  and  single  data 
channel-out  requirement  In  particular,  our  results  do  not  apply  to  algorithms  such  as 
deCastlejau’s  method  pictured  below  for  computing  cubics.  Note  that  the  cubics  produced  are  not 
in  standard  form. 


In  deCastlejau’s  method,  the  values  of  Pu  P2,  and  P  3  are  given  to  the  processing  elements  in 
the  bottom  row.  These  values  are  multiplied  by  x  and  sent  to  the  processor  pictured  above  at  the 
same  time  that  they  are  multiplied  by  1  -  x  and  sent  to  the  processor  at  right  This  simultaneity 
is  due  to  the  pipelined  architecture  and  the  special  design  of  the  processing  elements.  In  die  next 
step,  this  is  repeated  in  the  next  row  of  processing  elements  P4,  P5,  P6.  At  the  same  time,  the 
processors  in  the  bottom  row  are  sent  new  values  of  x  to  continue  the  process.  A  similar  thing 
happens  when  data  flows  up  from  the  first  row  to  the  second  row  and  from  the  second  row  to  the 
top  row.  Let  us  ignore  the  time  r.-^_  for  indexing  the  four  values  that  are  given  to  the  processing 

i 


elements  in  the  bottom  row.  The  steady  state  speed-up  and  efficiency  of  this  algorithm  are  each 
much  greater  than  that  of  the  obvious  sequential  analog  because  of  the  tremendous  decrease  in 
memory  access  and  pipelining.  In  fact,  for  a  single  value  we  have  a  parallel  time  of 
Stmem  +  3/0^  +  +  3/*«  as  opposed  to  42in„  +  12 +  12 for  the  sequential  method. 

Evaluation  of  N  points  using  this  pipeline  involves  (N  +  +  t^)  as  opposed  to  the 

sequential  time  which  is  42Ntmm  +  12 Nt-.,.  +  12 Nt^.  In  this  case,  the  speed-up  is  at  least 
12N/(N+2)  and  the  efficiency  is  at  least  1  for  N  larger  than  10.  This  provides  another  example  of 
the  well-known  fact  that  efficiencies  greater  than  one  are  easy  to  obtain  if  we  consider  poor 
sequential  algorithms. 

3.  Note  that  the  trivial  evaluation  of  a  polynomial  of  degree  one  takes  only  two  arithmetic  steps 
(one  multiplication  and  one  addition),  3  memory  accesses  and  one  index  time  and  thus  this 
evaluation  can  be  done  fastest  on  a  single  processor. 

4.  PARALLEL  ALGORITHMS  FOR  POLYNOMIALS  IN  TWO  VARIABLES 

In  this  section  we  will  consider  three  types  of  algorithms:  parallelized  versions  of  the  algorithms 
presented  in  the  previous  section,  parallelized  versions  of  matrix  algorithms,  and  pipelined  algorithms 
based  either  on  the  deCastlejau  method  or  on  the  method  of  finite  differences. 

The  evaluation  of  polynomials  in  two  variables  can  be  described  in  terms  of  the  evaluation  of 
polynomials  in  a  single  variable.  A  simple  way  to  do  this  is  to  recognize  that  a  polynomial  F  in  two 
variables  u  and  v  can  be  considered  as  a  polynomial  P  in  one  variable  v  whose  coefficients  are  poly¬ 
nomials  Qi  in  the  variable  u.  Using  this  idea,  the  time  for  sequential  evaluation  of  a  polynomial  F 
where  u  and  v  appear  to  at  most  the  third  power  is  four  times  the  time  for  the  evaluation  of  a  polyno¬ 
mial  in  a  single  variable  u  (since  there  are  four  polynomials  as  coefficients)  plus  the  time  for  evaluation 
of  a  polynomial  in  v  whose  coefficients  are  the  Q, .  This  is  5  times  the  values  of  the  time  given  in  the 
previous  section.  Using  Homer’s  method  as  a  starting  point,  the  time  for  evaluation  is 

‘  mull  +  ^mdtx 

where  the  additional  terms  are  obtained  from  accessing  the  Qi . 
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A  matrix  method  is  frequently  used  for  sequential  evaluation  of  polynomials  in  two  variables  on  a 
single  processor.  This  method  involves  the  quadratic  form  UMVT  where  U  and  V  represent  the  row 
vectors  1,  u,  u2,  u3  and  1,  v,  v2,  v3,  M  represents  the  matrix  of  coefficients,  and  the  superscript  T  indi¬ 
cates  the  transpose.  This  involves  4  multiplications  for  the  powers  of  the  variables,  20  multiplications 
involving  coefficients,  15  additions,  55  memory  accesses,  and  40  index  accesses.  The  total  can  be 
reduced  by  removing  redundant  multiplications  by  1  to 

15  tgjj  +  19  tmilh  +  45tHOT  +  40t._i_ ,  (4.1) 

which  is  slightly  slower  than  the  first  sequential  example. 

The  computations  in  this  algorithm  can  obviously  be  speeded  up  by  performing  some  of  them  in 
parallel.  For  example,  each  of  the  4  multiplications  of  a  row  and  a  column  vector  can  be  done  in  paral¬ 
lel  with  the  last  results  being  communicated  to  a  single  processor.  Using  4  processors,  the  time  is 
reduced  to 


and  the  speedup  is 


+  18tmtw  +  16 +  4rcomm . 


(4  2) 


15f«u  +  1 9u  +  45^  +  40^ 


6/^  +  +  18t _ +  16r, 


+  4/, 


In  the  remainder  of  this  section,  we  will  consider  the  extent  to  which  the  times  in  equation  (4.2)  can  be 
improved  if  some  of  the  computations  are  done  in  parallel. 

We  will  use  the  term  bicubic  polynomial  to  denote  a  polynomial  in  two  variables  with  real 
coefficients  such  that  each  variable  appears  to  at  most  the  third  power. 

Theorem  2  :  Let  P{u,v)  be  a  bicubic  polynomial  in  the  real  variables  u  and  v.  The  speedup  in  the 
time  to  evaluate  P(u  ',v)  using  two  processors  and  no  matrix  methods  is  at  most 

IStgjj  +  15^,  +  43  Lioti  +  19  tiHAsx 
9 tadd  9t null  +  '2Atmtm  +  lOtuwiM  +  tcomm 

and  the  efficiency  is  at  most  19/10.  Using  four  processors,  the  speedup  is  at  most 

15  lodi  +  15^,  +  43tw<m  +  19t^ 

6  tadd  +  +  15/«*  +  lOt^  +  4  tc 

and  the  efficiency  is  at  most  43/15. 


(4.4) 
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Proof. 


In  the  algorithms  given  for  evaluation  of  a  polynomial  in  a  single  variable  presented  in  the  previ¬ 
ous  section,  all  of  the  coefficients  were  assumed  to  have  been  loaded  into  the  memory  of  the  appropri¬ 
ate  processor.  In  addition,  the  time  for  the  pre-processing  of  the  original  coefficients  in  Knuth’s  algo¬ 
rithms  (A3,  la,  A3 .2a,  A4.2a,  and  A4.2b)  was  ignored  since  it  was  presumed  to  have  been  small  com¬ 
pared  to  the  large  amount  of  time  needed  for  evaluation  of  the  polynomial  at  many  points.  Neither  of 
these  assumptions  may  be  used  directly  in  the  case  of  polynomials  in  many  variables  since  the  depen¬ 
dence  of  coefficients  of  a  polynomial  pre-processing  and  loading  of  coefficients  has  not  generally  been 
done  prior  to  the  evaluation  of  the  polynomial. 

For  two  processors,  there  are  two  basic  organizations  possible:  use  a  parallel  algorithm  for  each 
evaluation  or  use  sequential  algorithms  for  the  polynomials  in  one  variable,  using  each  processor  in  a 
sequential  manner,  with  communication  between  processors  at  the  end.  Using  the  parallel  Homer’s 
algorithm  A3 .2  five  times,  we  have  a  time  of 

(4-5) 

to  which  we  must  add  the  time  for  extra  memory  accesses  due  to  the  various  coefficients  of  the  already 
computed  polynomial  in  one  variable  not  being  in  the  memory  of  the  correct  processor  for  the  computa¬ 
tion  of  the  polynomial  in  two  variables  as  well  as  time  for  communication.  Using  the  most  efficient 
choices  causes  the  total  obtained  by  using  A3.2  five  times  to  change  from  equation  (4.5)  to 

10r«u  +  15  Imuh  +  28  tmm  +  14/jaja  +  ^  tecum-  (4.6) 

A  similar  design  using  a  parallel  Knuth’s  algorithm  A3.2a  five  times  gives  a  total  time  of 

5(2 1.U  +  2^  +  5tmtm  +  2t^  +  rMmm)  +  8r«*  +  4:**,  +  r,  (4.7) 

where  t  represents  the  time  for  "pre-processing"  the  polynomials  in  a  single  variable  to  become 

coefficients  of  a  polynomial  in  two  variables.  This  last  step  is  necessary  since  the  output  of  Knuth’s 

method  is  a  polynomial  in  standard,  not  pre-processed  form.  This  pre-processing  takes  many  additions 

and  multiplications  and  cannot  be  ignored  since  it  is  done  whenever  we  evaluate  the  polynomial  in  two 

variables.  Thus  this  algorithm  does  not  easily  parallelize. 


The  other  method  that  we  consider  involves  using  sequential  methods  as  long  as  possible.  Here 


we  use  processor  1  to  compute  the  polynomials  2o  and  Q 2  with  processor  2  computing  Q ,  and  Q 3  fol¬ 
lowed  by  a  parallel  evaluation  of  the  polynomial  in  two  variables.  The  total  time  is 


2(3 taU  "*■  3t, mull  +  +  3 f«d*j)  +  + 

3r*«  +  3/,^  +  Atntn  +  "hutdtx  +  8  tmtm  +  4/,*kx  +  ‘comm 

for  a  total  of 

9  tmu  +  9u  +  24  ^  +  lOr^  +  u  (4.8) 

which  is  considerably  smaller  than  either  (4.6)  or  (4.7).  This  proves  (4.3).  To  prove  (4.4),  we  simply 

apply  the  same  reasoning  to  the  case  of  four  processors  and  keep  track  of  added  communication, 

memory  and  index  times  to  obtain  the  desired  result.  This  completes  the  proof  of  the  theorem. 


Theorem  3  :  Let  P(u,v)  be  a  bicubic  polynomial  in  the  real  variables  u  and  v  and  suppose  that  P  is 
to  be  evaluated  at  N  points  without  using  matrix  methods.  Then  the  speedup  in  the  evaluation  time  is 
asymptotically  the  same  as  in  theorem  2  for  the  case  of  two  processors  and 


_ 4^(15  u,  +  19rw,  +  45u„  +  40t^  +  St^) _ 

(5 N  +  8)((3r«a  +  3r„*  +  +  3^)  +  8f^  +  4^  +  4 Nt^)  ' 

for  four  processors.  If  five  processors  are  available,  the  speedup  improves  to 


(4.9) 


N(15r«u  +  19  tooj,  +  45 1, 

(N  +  l)(3taji  +  3 limit  7 ‘mem  +  3?^, 


tmm  +  *0^  +  St, 


Proof. 


•  ■ 

For  a  two  processor  system,  one  of  the  processing  elements  will  be  idle  much  of  the  time  in  the 
final  computation  of  the  bicubic.  This  is  still  the  case  if  there  are  many  points  at  which  the  bicubic  is 
to  be  evaluated  and  hence  the  speedup  is  essentially  limited  by  the  factors  slowing  down  the  evaluation 
at  a  single  point.  The  maximum  speedup  is  dependent  on  careful  programming  to  minimize  idle  time; 
in  any  event  it  is  similar  to  the  results  previously  given. 


Note  that  the  effect  of  has  been  changed  in  the  numerator  of  (4.9)  and  (4.10).  This  was  done 
to  account  for  the  new  loading  of  the  values  of  the  coefficients  into  memory  at  initialization;  this  was 
not  considered  previously.  To  evaluate  a  bicubic  polynomial  at  N  points  using  four  processors,  we 
must  also  have  some  amount  of  idle  processor  time.  The  results  from  each  of  the  single  variable  evalua¬ 
tions  must  be  made  available  to  the  processor  performing  the  evaluation  in  the  second  variable.  The 
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evaluations  being  performed  are  given  below. 

PROCESSOR  1  PROCESSOR  2  PROCESSOR  3  PROCESSOR  4 
1111 
1  -2  2  2 

2  3  3  3 

2  3  4  4 

3  4  4  5 

4  5  5  5 

5  6  6  6 

The  state  of  available  processors  is  precisely  the  same  in  this  last  state  as  it  was  in  the  second 
state  where  evaluation  2  was  being  done  on  processors  2,  3,  and  4.  Thus  the  time  for  evaluation  of  N 
points  is  the  sum  of  an  initialization  time  plus  a  time  for  getting  to  the  same  state  plus  a  time  for  termi¬ 
nation.  The  total  is  dependent  on  the  value  of  N  modulo  4;  in  the  simplest  case  it  is 

(5 N  +  8)((3 tgjj  +  )  +  8tMm  +  +  4Ntcomm  )/4  , 

Considerable  improvement  is  possible  if  we  have  five  processors  available.  In  this  case,  the 
evaluations  of  the  polynomials  in  a  single  variable  are  done  on  four  processors  and  the  fifth  processes'  is 
devoted  to  evaluation  of  the  bicubic  after  data  is  transmitted  to  it  The  total  time  for  evaluation  of  a 
bicubic  at  N  points  is  thus  approximately  N  times  the  time  for  evaluation  of  a  single  variable  polyno¬ 
mial  with  an  additional  time  added  for  evaluation  of  the  last  bicubic  on  the  fifth  processor.  The  actual 
total  is 

(N  +  1)(3 tgdd  +  h mult  +  7tm*m  +  +  N  (St mem  +  4 tiudtx  +  4 tcomm). 

This  completes  the  proof  of  the  theorem. 
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The  next  method  we  consider  is  that  of  using  pipelined  architectures.  Modifying  the  method  of 
deCastlejau  leads  to  a  pyramidal  architecture  scheme  in  which  there  are  16  nodes  at  lowest  level, 
grouped  into  4  collections  of  four  nodes  each.  Each  of  these  collections  computes  a  different  polyno¬ 
mial  in  one  of  the  variables  and  sends  its  output  to  another  collection  of  nodes  using  the  same  pipelined 
design  as  considered  in  the  previous  section.  This  last  collection  of  nodes  takes  the  values  of  the 
second  independent  variable  and  continues  as  in  the  deCastlejau  model  in  the  previous  section  to  com¬ 
pute  the  values  of  the  polynomial  in  terms  of  the  second  independent  variable  and  thus  produces  die 
output  of  a  polynomial  of  appropriate  degree  in  both  variables.  As  before,  the  polynomial  is  not  in 
standard  form  but  instead  appears  in  a  form  conducive  to  numerical  computation.  The  next  result  is 
easily  obtained  using  the  same  reasoning  as  in  the  analysis  of  deCastlejau’s  method  in  the  previous  sec¬ 
tion;  the  proof  is  omitted.  As  before,  efficiency  larger  than  1  is  possible  for  this  algorithm  using  a 
pipelined  architecture. 

Theorem  4:  The  speedup  using  deCastlejau’s  method  instead  of  sequential  evaluation  to  evaluate  a 
bicubic  polynomial  at  N  points  is 

5(42 M—  +  12M„«  +  12M-,) 

(N  + 

The  efficiency  is  60N/(N+€),  which  is  greater  than  1  if  N  >  12. 


5.  DISCUSSION 

The  primary  motivation  for  the  research  in  this  paper  was  the  controversy  generated  by  the  papers 
[2],  [5],  and  [9]  and  the  observation  in  [4]  that  many  parallel  algorithms  do  not  perform  at  or  even  near 
their  theoretical  limits  on  many  actual  parallel  machines.  The  algorithms  were  implemented  in  assem¬ 
bly  language  on  a  simulator  which  can  be  configured  to  model  a  number  of  different  distributed  system 
architectures  and  CPU’s.  Implementing  these  algorithms  using  low  level  assembly  languages  emulated 
the  actual  running  of  a  computer  and  avoided  making  any  assumptions  about  the  quality  of  code  gen¬ 
erated  bya  compiler  or  assembler.  Each  load  or  store  instruction  was  counted  as  a  memory  access  and 
it  was  assumed  that  the  time  for  data  transfer  into  and  from  memory  is  the  same.  In  general,  the 
observed  execution  times  showed  good  agreement  with  the  theoretically  obtained  times  described  earlier 
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in  the  paper. 

The  optimal  number  of  processors  for  parallel  computation  of  second,  third,  or  fourth  degree 
polynomials  of  a  single  variable  at  a  single  input  value  on  non-pipelined  systems  is  2.  While  it  is  pos¬ 
sible  to  obtain  a  parallel  algorithm  for  particular  choices  of  polynomials,  communication  and  synchroni¬ 
zation  constraints  make  it  difficult  to  design  an  efficient  generalized  parallel  algorithm  which  is 
independent  of  either  underlying  connection  architecture  or  the  actual  architecture  of  the  processor  ele¬ 
ments. 

Note  also  that  the  models  of  computation  described  in  this  paper  all  assumed  that  the  processors 
shared  no  common  memory;  this'  assumption  is  common  in  hypercube  computers.  However,  shared 
memory  simply  reduces  the  time  for  inter-processor  communication  and  thus  the  value  of  the  quantity 
tcomm  will  be  smaller  than  in  a  non-shared  computer.  Of  course,  the  time  represented  by  it  will  still  be 
present  in  any  real  situation.  For  example,  the  reference  [3]  has  an  example  of  the  use  of  the  method 
of  finite  differences  to  speed  up  sequential  evaluation  of  bicubics.  Unfortunately,  it  seems  that  a  pipe¬ 
lined  method  such  as  the  method  of  finite  differences  can  only  be  used  on  a  shared  memory  multi¬ 
processor  system.  No  speedups  were  found  using  this  method  for  evaluation  of  bicubics  on  our  simula¬ 
tor  for  any  architecture. 

For  quadratic  functions,  Homer’s  method  provides  the  best  pre-processing  scheme.  Hence  the 
sequential  processing  of  a  quadratic  will  exploit  this  scheme.  On  the  other  hand,  for  n  £  3,  the  scheme 
by  Knuth  [6]  will  be  utilized.  In  this  example  also,  there  is  no  need  for  more  than  2  processors.  Even 
for  n  =  4,  the  optimal  number  of  processors  is  apparently  2.  Furthermore,  parallelizing  evaluation  of 
cubics  via  Knuth’s  pre-processing  scheme  on  a  2-processor  computer  cuts  the  multiplications  to  2,  addi¬ 
tions  to  2,  and  memory  accesses  to  5  with  1  inter-processor  communication.  Nevertheless,  the  speed-up 
obtained  is  actually  an  approximate  value  because  of  the  pre-processing  involved.  To  obtain  the  param¬ 
eters  in  the  pre-processing  scheme,  n+1  simultaneous  equations  have  to  be  solved.  The  roots  of  these 
equations  may  even  be  complex.  Therefore  if  the  same  polynomial  is  not  to  be  evaluated  at  numerous  x 
values  then  this  method  may  not  be  superior  to  Homer’s.  Given  this  fact,  our  experiment  indicates  that 
using  Knuth’s  scheme  for  low  degree  polynomials,  evaluation  can  be  implemented  on  a  2-processor 


-  17  - 


computer  but  that  the  efficiency  is  considerably  less  than  1. 

The  optimal  number  of  processors  far  evaluation  of  bicubics  at  N  points  is  a  multiple  of  5  assum¬ 
ing  that  the  communication  times  between  processors  is  constant.  If  the  number  of  processors  is  a  mul¬ 
tiple  of  5,  then  the  results  given  in  theorem  3  can  simply  be  divided  by  an  appropriate  constant 

A  final  remark  is  in  order  about  bottlenecks  in  a  system  for  display  of  the  bicubics  that  we  con¬ 
sidered  in  this  paper.  Most  computer  graphics  systems  use  either  a  raster-type  output  device  such  as  a 
CRT  screen  or  laser  printer  in  which  the  picture  is  scanned  one  line  at  a  time  or  a  vector  device  where 
a  writing  device  moves  from  one  point  to  the  next  rather  than  a  line  at  a  time.  The  most  common 
example  of  a  vector  device  is  a  plotter  although  vector  CRT  screens  are  still  available.  In  general,  all 
of  these  devices  are  sequential  and  vector  devices  in  particular  are  not  well  suited  to  parallelization. 
Perhaps  the  next  step  is  special  parallel  output  devices. 
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L  INTRODUCTION 

A  major  problem  in  solid  modeling  is  computing  a  "blending  surface"  between  two  intersecting 
surfaces.  In  a  typical  example,  the  surfaces  are  described  by  some  polynomial  equation  of  low  degree 
and  the  Mending  surface  is  intended  to  provide  a  transition  between  the  surfaces.  The  actual  form  of 
the  blending  surface  or  its  equation  are  generally  not  as  important  as  the  need  to  rapidly  compute  the 
surface  in  order  to  be  able  to  use  it  in  an  interactive  system.  For  an  example  of  a  blending  surface, 
look  at  a  standard  tekpbooe. 

One  obvious  method  of  describing  the  blending  of  two  surfaces  is  to  use  the  surface  that  is  swept 
out  by  a  rolling  ball  tangent  to  both  of  the  surfaces.  Unfortunately,  this  method  often  has  the  undesir¬ 
able  effect  of  having  blending  surfaces  which  have  much  higher  degree  than  the  surfaces  to  be  blended, 
which  are  often  quadric  Strange  geometric  effects  are  often  associated  with  this  type  of  blend,  particu¬ 
larly  when  there  are  more  than  one  surface  to  be  blended. 

Many  authors  ([2],[3].[4],[6],[8],[9]and  [10])  have  considered  the  problem  of  the  generation  of 
blending  surfaces.  Middleditch  A  Sears  [6]  describe  blending  by  a  method  which  gives  blending  sur¬ 
faces  of  degree  4  if  the  surfaces  to  be  blended  are  of  degree  2.  In  [10],  the  author  considers  using  other 
blending  surfaces  described  by  toroids  and  cylindrical  pieces  in  which  the  blending  surfaces  also  have 
low  algebraic  degree 

Hopcroft  A  Hoffman  have  taken  another  view.  In  a  collection  of  papers  ([2],  [3]  and  [4],  they 
develop  a  method  that  they  call  the  projective  method.  This  method  uses  some  techniques  from  alge¬ 
braic  geometry  to  obtain  a  surface  which  blends  two  surfaces  by  choosing  curves  on  each  and  then 
obtaining  a  surface  which  is  tangent  to  the  surfaces  at  the  given  curves.  In  [3],  they  show  that  if  the 
curves  are  quadratic  curves  and  the  surfaces  are  quadric,  then  the  blending  surfaces  are  of  degree  four. 

All  of  these  methods  lead  to  str'aces  which  blend  the  existing  surfaces.  In  many  but  not  all 
applications,  they  provide  satisfactory  results  from  an  aesthetic  point  of  view.  We  note  that  there  are 
typically  two  concerns  for  blending  surfaces  in  these  papers:  rapid  computation  and  description  of  the 
surfaces  by  simple  equations  erf  relatively  low  degree.  The  equations  of  low  degree  are  desirable  since 
blending  is  o tv  followed  by  shading  and  by  ray  tracing.  The  skiing  and  ray  tracing  algorithms  are 
far  simpler  and  master  than  if  the  degree  of  the  surface  is  smalL  The  only  geometric  constraint  ever 
used  is  that  of  tangency  to  the  surfaces,  often  in  prescribed  curves. 

In  this  note,  we  consider  the  generation  of  blending  surfaces  which  attempt  to  minimize  the  sur¬ 
face  area  of  the  blend.  Our  technique  will  involve  a  mathematical  theory  known  as  the  calculus  of 
variations.  We  present  an  analysis  of  the  difficulties  involved  with  this  technique  and  show  the  applica¬ 
tion  to  several  problems. 

This  paper  is  grouped  into  four  sections.  Section  1  is  the  introduction.  In  section  2,  we  give 
some  results  from  the  calculus  of  variations  that  are  relevant  to  blending  surfaces.  In  section  3,  we 
apply  these  results  to  some  simple  blending  surfaces.  In  section  4,  we  analyze  the  results  obtained  and 
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close  with  some  suggestions  for  implementation  and  for  future  work. 


2.  RELEVANT  NOTIONS  FROM  THE  CALCULUS  OF  VARIATIONS  ■ 

The  calculus  of  variations  was  largely  developed  by  Euler  and  Lagrange  in  the  seventeenth  cen¬ 
tury.  It  is  concerned  with  the  problem  of  obtaining  a  curve  or  surface  which  maximizes  or  minimi*^ 
some  function  of  the  curve  or  surface.  In  the  case  of  a  curve,  the  problem  to  be  solved  generally 
requires  minimizing  or  maximizing  the  functional 
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where  F  is  some  function  of  x,  the  unknown  curve  y  ■  y(x)  and  y  represents  the  derivative  of  y  with 
respect  to  x.  For  simplicity,  the  limits  of  integration  have  been  omitted.  The  solution  to  this  problem 
must  satisfy  the  well-known  Euler  differential  equation 


These  results  can  be  extended  to  problems  involving  surfaces  and  higher  derivatives;  in  fact,  we  will 
use  the  extension  to  functions  of  surfaces  later.  A  huge  number  of  papers  have  been  written  on  this 
subject  The  text  [1]  is  an  excellent  reference  for  the  theoretical  background  needed  for  a  complete 
understanding  of  the  subject  as  used  here  as  well  as  derivations  of  the  major  formulas.  We  will  be  con¬ 
cerned  with  two  uses  of  the  calculus  of  variations  with  regard  to  the  problem  of  finding  the  surface  of  I 

minimum  area  with  certain  restrictions. 

The  first  problem  we  consider,  is  die  following  [1,  p20-21].  Among  all  smooth  curves  which  pass 
through  two  fixed  points  P  and  Q,  find  the  one  for  which  the  area  of  the  surface  of  revolution  formed 
by  revolving  the  curve  about  the  x  axis  is  a  minimum.  This  leads  us  to  the  problem  of  finding  a  func¬ 
tion  y  «y(x)  whose  graph  passes  through  P  and  Q  and  which  minimizes  the  surface  area  given  by  the  I 

integral 

J\y]  *  j y  V  1  +  dy/dx 2  dx  . 

In  this  case,  the  Euler  differential  equation  becomes 

F  -  y  Fj  *  0  .  | 

which  has  the  solution 


y  *=  C  cosh{{  x  +  K)/C ) 


This  function  y  will  be  the  unique  solution  of  the  problem  provided  that  a  single  curve  of  this 
form  can  be  drawn  through  P  and  Q.  There  is  always  a  smooth  solution  provided  that  the  slope  of  the  I 

line  joining  them  is  sufficiently  small;  in  the  limit  as  x2  -  xl  approaches  0,  the  slope  must  approach  0. 

Note  that  the  graph  of  the  function  y(x)  is  a  catenary  which  is  the  shape  of  a  heavy  cable  with  no  extra 
load  and  which  is  acted  only  by  the  force  of  gravity.  The  surface  of  revolution  is  also  well-known;  it  is 
called  a  catenoid. 

In  the  next  example,  we  consider  the  much  more  interesting  problem  of  surfaces  that  are  not  sur¬ 
faces  of  revolution  [1,  p22-24].  For  the  case  where  the  surface  can  b*  written  in  the  form  z  ■  z(x,y), 
the  problem  is  to  find  the  minimum  of  the  functional 

J\y)  ■  V  1  +*,*  +  ty2 

and  thus  Euler’s  equation  has  the  form 

r  (  1  +  q*  -  2spq  +  f  (1  +  p2)  =  0 


p  *  z„  q  -  ty,  r  •  r„,  s  *  z„,  t  =  t„  . 


where 
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The  left  hand  side  of  Euler’s  equation  is  the  numerator  in  the  equation  of  the  mean  curvature  of 
tbe  surffee  and  thus  we  know  that  any  surface  which- minimizesitha  surface  area  must  have  0  mean  cur¬ 
vature.  See  [I]  and  [5]  for  a  discussion  of  the  meaning  of  these  terms  in  differential  geometry.  Note 
that  this  analysis  did  not  make  any  use  of  the  underlying  region  of  integration  and  that  in  particular  did 
not  use  the  boundary  conditions  which  are  necessary  for  a  blending  surface.  (  For  a  discussion  of  this 
topic,  see  [1.  pl73-176]).  In  general,  the  non-linear  partial  differential  equation  is  difficult  to  solve 
explicitly,  even  in  the  simplest  cases.  In  addition,  there  are  often  many  solutions  to  the  partial 
differential  equation  for  given  boundary  conditions  which  are  not  even  local  solutions  to  the  problem  of 
minimizing  the  total  surface  area. 

We  have  two  choices  in  this  situation.  We  can  ask  for  a  complete  solution  of  the  partial 
differential  equation  with  the  appropriate  boundary  conditions  and  require  that  the  solution  also  minim¬ 
ize  the  area.  Of  course,  we  must  expect  that  a  large  amount  of  numerical  computation  will  be  neces¬ 
sary.  Another  choice  is  to  find  simple  solutions  to  the  Euler  equation,  even  though  we  ignore  the  boun¬ 
dary  requirements.  This  method  will  provide  blending  surfaces  with  0  mean  curvature,  but  is  unlikely 
to  actually  minimize  the  surface  area. 

In  this  paper,  we  elect  the  second  option.  We  wish  to  use  as  our  solution  of  Euler’s  equation  the 
following  surface  called  Enneper’s  surface  by  Rassias  [8,  p429].  We  will  change  the  notation  of  [8] 
slightly  to  write  the  surface  as  a  set  in  x-y-z  space  with  the  surface  being  the  image  of  die  disk 
u2  +  v2  <  r  in  the  u-v  plane.  The  surface  is  given  by  the  equations 

X  ■  V  +  «V2  -  -ill* 

3 

2  1  3 

y  m  -v  -14  V  + 

X  rn  u2  -  V2 

As  is  clear  from  figure  1,  the  surface  is  simply  connected  as  a  subset  of  R3  and  is  simple  (not 
self-intersecting)  for  r  <  VJ  [8].  Consequently,  it  is  simple  for  the  unit  square,  which  is  a  smaller  set 
(figure  2).  It  is  easy  to  check  that  this  surface  satisfies  the  Euler  equation  and  thus  has  0  mean  curva¬ 
ture. 


3.  APPLICATIONS 

In  die  previous  section,  we  considered  the  problem  of  determining  minimal  surfaces  with  certain 
restrictions.  We  obtained  a  complete  solutions  only  for  surfaces  of  revolution.  For  other  surfaces,  we 
were  able  to  obtain  the  restriction  .of  the  surface  having  0  mean  curvature  and  gave  an  example  of  a 
simplex-surface1  with  this' property.  In  this  section,  we  apply  these  results  to  a  few  mrampi^  of  simple 
blending  surfaces. 

Our  first  example  is  that  of  a  cylinder  of  revolution  intersecting  a  plane  surface  in  a  right  angle. 
Id  this  situation,  we  have  surfaces  of  revolution  and  hence  we  may  use  the  catenoid  surface  of  the  pre¬ 
vious  section,  hi  figure  3  we  show  the  two  surfaces  without  blending  and  in  figure  4  we  use  a  Mwwtiwg 
surface.  The  blend  is  a  catenoid  obtained  by  rotating  the  curve 

y  ■  C  cosh((  x  +  K)IC) 

about  the  x-axis.  The  local  coordinates  (-K,C,0)  are  chosen  for  the  intersection  with  the  plane  x  «  0 
and  the  coordinates  (0,C,0)  are  chosen  for  the  intersection  with  the  cylinder. 
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Our  second  example  is  typical  of  many  problems  in  this  field  in  that  simple  geometric  surfaces 
may  lead  to  somewhat  complicated  blending  surfaces.  We  consider  the  case  of  two  right  cylinders  of 
equal  radius  intersecting  at  a  right  angle.  In  figure  5,  the  two  cylinders  are  sketched  and  their  intersec¬ 
tion  is  given  in  bold.  In  figure  6,  we  use  a  blend  consisting  of  portions  of  Enneper’s  surface. 
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4.  ANALYSIS  OP  RESULTS  OBTAINED 


We  hate  presented  two  surfaces-  ® be  used  in  blending  problems.  The  primary  goal  was' to 
minimize  the  surface  area  of  the  blending  surface  with  an  easily  computed  surface.  Use  of  these  blend¬ 
ing  surfaces  have  been  demonstrated  in  two  examples  of  somewhat  different  nature.  It  is  reasonable  to 
ask  bow  these  surfaces met  tbe.jisual  criteria. fqttrtending  surfaces..  That  is,  are  they, smaH  perturba- 
'  tton si  easily  computed,  geometrically  significant,  and  well-suited  to  control  by  a  user  of  a  solid  model¬ 
ing  system? 

We  first  consider  the  question  of  speed.  Using  the  cubic  surface  of  Enneper  described  in  this 
paper  requires  the  computation  of  a  cubic  polynomial  in  two  parameters  u  and  v.  For  the  blending  of 
two  quadratic  surfaces,  this  is  slightly  Caster  than  the  computation  of  the  general  fourth  degree  blending 
surface  that  is  generated  by  the  projective  method  of  Hoffman  &  Hopcroft  or  the  fourth  degree  surface 
generated  by  the  method  of  Middleditch  &  Sears.  It  is  certainly  faster  than  the  higher  degree  surfaces 
that  have  been  used  by  other  authors.  For  higher  degree  surfaces,  our  method  still  uses  a  cubic  blend¬ 
ing  surface  while  these  other  methods  produce  higher  degree  blends.  In  addition,  our  method  works  for 
non-polynomial  surfaces  whereas  the  projective  method  does  not 

For  situations  such  as  in  the  first  example  considered  in  this  papa,  blending  surfaces  of  revolu¬ 
tion  are  useful.  At  first  glance,  they  appear  to  be  highly  complex  surfaces  which  require  extensive 
computation  to  plot  However,  the  catenoid  given  here  depends  on  two  parameters  C  and  K  which 
depend  cm  the  points  chosen  as  controls.  We  could  have  used  a  lookup  table  for  the  values  of  the 
catenary  which  is  to  be  rotated.  This  speeds  up  the  computation  of  the  blending  surface  but  may  cause 
a  large  time  penalty  if  ray  tracing  is  to  follow  the  blend  because  of  the  time  needed  to  compute  the 
intersections  between  the  surface  and  rays  of  light  This  excess  computation  may  be  reduced  by  replac¬ 
ing  the  hyperbolic  cosine  by  the  first  few  terms  of  its  Taylor  expansion  about  the  origin  as  in  figure  7. 
We  note  the  similarity  between  die  surfaces  in  figure  4  and  figure  7  .  This  is  an  example  of  another 
phenomenon  in  computer  graphics  in  which  several  different  approximations  of  a  surface  give  similar 
pictures. 


In  the  more  difficult  case  where  we  do  not  have  surfaces  of  revolution,  the  more  general  type  of 
blending  surface  is  needed.  Here  we  use  a  cubic  surface  which  is  simple  to  compute.  It  often  requires 


a  certain  amount  of  patching  bat  this  is.  unavoidable  in  many  instances. 

.  .  In.eacbaf.the  situations  discussed  in  .  this  paper,  we  obtained  blending  surfaces  that,  were  small 

pertnrbatioos/easily  computed  and  well  suited  to  control  by  a  user  of  a  geometric  modeling  system.  In 
the  first  case  of  surfaces  of  revolution  blending  into  another  surface  of  revolution  about  the  same  axis, 
the  result  minimizes  surface  area.  In  the  second,  more  general  situation,  the  proposed  blend  is  a  solu- 
•  tioa  of  'Eider'S  tapJhrioa  but '  nfcfcf  hof  actually  nrinimfee'  the  surface  ~area/  ’  ‘ '  *■  V  ' 

Unfortunately,  these  blending  surfaces  are  not  tangent  to  the  given  surfaces  in  general.  We  may 
elect  to  either  accept  these  non- tangential  blending  surfaces  as  is  or  to  reblend  them  near  the  points  erf 
intersection  of  the  blending  surface  and  the  given  surface.  A  ’  reblend’  in  this  case  would  involve  treat¬ 
ing  the  blending  surface  as  a  small  perturbation  and  then  multiplying  the  values  of  the  blend  by  a  small 
parameter  approaching  0  rapidly.  Perhaps  the  use  of  lookup  tables  for  the  exponential  function  and  the 
catenary  far  values  of  the  variable  x  would  be  appropriate  in  this  context 


An  additional  problem  with  most  existing  approaches  to  blending  surfaces  is  that  the  geometry  of 
such  surfaces  is  not  always  well  understood  or  related  to  the  ’reality*  of  the  situation.  In  particular, 
many  winding  algorithms  when  applied  to  even  simple  examples  such  as  quadric  surfaces  which 
describe  solid  objects,  cannot  be  guaranteed  to  lie  entirely  outside  of  the  existing  objects  in  the  case  of 
external  blends.  A  similar  problem  also  occurs  for  internal  blends.  Future  work  will  consider  this 
problem  as  well  as  the  nature  of  the  reblends  mentioned  earlier. 
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L  INTRODUCTION 

A  major  problem  in  solid  modeling  systems  occurs  when  two  solids  intersect  Generally,  it  is  not 
sufficient  to  simply  compute  the  intersection  of  the  two  surfaces  that  bound  the  sol'  U.  It  is  often  more 
important  to  replace  the  two  surfaces  near  their  intersection  by  a  blending  surface  which  allows  a 
smooth  transition  between  the  surfaces.  Blending  surfaces  should  have  the  properties  of  being  :  easily 
computed,  well  suited  to  being  accessed  by  other  software  in  the  solid  modeling  system,  and  being 
geometrically  accurate.  Many  papers  have  been  written  on  the  subject  of  blendinv  -urfaccs  using  a 
variety  of  techniques;  [4],  [5],  [10],  [11],  and  [12]  are  somewhat  typical. 

Some  of  these  papers  consider  the  blending  surface  as  the  final  step  in  a  "^vesentation  and 
display  process.  They  emphasize  the  aesthetic  appeal  of  the  blending  surface  and  are  somewhat  less 
concerned  with  the  analytic  and  computational  properties  of  the  blend.  The  blend  surfaces  obtained  are 
typically  appealing,  but  are  often  difficult  to  integrate  rapidly  into  the  other  portions  of  the  solid  model¬ 
ing. system  because  of  the  high  algebraic  degree  of  the  surfaces  obtained.  Other  papers  (e.g.  [11]) 
emphasize  the  integration  into  a  solid  modeling  system  but  do  not  emphasize  geometn  .spects. 

Other  techniques  for  blending  surfaces  emphasize  the  analytic  requirements  for  rapid  computation 
while  not  emphasizing  the  global  geometric  properties  of  die  surface.  An  example  of  this  is  the  projec¬ 
tive  method  of  Hopcroft  and  Hoffman  [4]  which  computes  the  surface  of  minimal  decree  which  is 
nngent  to  certain  surfaces  in  prescribed  curves.  Their  method  gives  the  blending  surface  of  minimal 
degree  which  is  tangent  to  the  given  surfaces  in  the  prescribed  curves,  but  no  conditions  on  die 
geometry  of  the  surfaces  is  given. 

The  purpose  of  this  note  is  to  study  the  applicability  of  blending  surfaces  which  computation¬ 
ally  tractable  and  which  are  also  "minimal  surfaces”.  Minimal  surfaces  have  a  particular  property 
which  is  related  to  but  not  always  identical  with  minimizing  surface  area;  the  exact  relationship  and  the 
mathematical  foundation  is  discussed  in  section  2.  This  minimizing  property  is  very  desirable  in  a  solid 
modeling  system  since  the  goal  of  Mending  surfaces  in  such  a  system  is  to  provide  a  smooth  transition 
between  solid  objects  which  can  be  efficiently  implemented  by  automated  manufacturing  tools  such  as 
milling  systems. 

In  this  context,  computationally  tractable  generally  means  that  the  surface  is  of  low  degree.  In 
this  paper,  we  will  consider  those  surfaces  which  are  both  computationally  tractable  and  geometrically 
significant' 

The  paper  is  organized  as  follows.  In  the  next  section,  die  background  results  and  terminology 
for  the  discussion  of  minimal  surfaces  is  given.  In  section  3,  we  consider  surfaces  erf  degree  2  and 
show  that  the  only  minimal  surfaces  of  degree  2  are  planes.  It  would  be  natural  to  perform  the  same 
ac'dysis  for  surfaces  of  degree  3  and  4  by  considering  all  of  the  possible  cases.  Ho*.;vcr,  there  are  99 
possible  types  of  surfaces  and  many  of  these  types  involve  so  many  coefficients  that  analysis  is  beyond 
the  limits  erf  the  symbolic  computation  program  MACSYMA  (and  hence  beyond  the  limits  of  any 
human  performing  the  algebraic  computations).  Thus  in  section  4  we  consider  an  alternate  representa¬ 
tion  for  minimal  surfaces.  In  section  5  we  show  some  examples  of  the  use  of  minimal  blending  sur¬ 
faces  and  the  effects  of  certain  degrees  of  freedom  on  the  behavior  of  the  surfaces.  Section  6  provides 
a  summary  of  results.  The  ~aper  closes  with  a  discussion  of  some  open  problems. 
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2.  MINIMAL  SURFACES 

A  minimal  surface  is  defined  as  the  set  of  all  points  (xyj)  which  satisfy  the  second  order  J 


differential  fqimHwi 

r(l  +  q2)  -  2spq  +  r(l  +P2). 

(2.1) 

where 

P  =  *1  <  ?  8  *J|  =  ta  •  S  ~  »  t  =  Xyy. 

(2.2) 

In  differential  geometry  ([1],  £2],  [7],  £91),  it  is  well-known  that  the  vanishing  of  the  left  hand  r*.  of 
this  equation  implies  that  the  mean  curvature  of  the  surface  is  0. 

The  partial  differential  equation  is  obtained  as  the  solution  to  a  "variational  problem"  of  minimiz¬ 
ing  the  surface  area.  An  infinite  number  of  functions  satisfy  the  equation;  this  is  not  surprising  suy*-  uo 
boundary  conditions  are  given.  Unfortunately,  even  when  complete  boundary  conditions  are  given  in 
the  form  of  requiring  that  the  surface  must  pass  through  some  boundary  curve,  there  may  be  many  solu¬ 
tions,  not  all  of  which  actually  minimize  the  surface  area  over  the  class  of  all  curves  which  pass 
through  the  given  boundary  curve.  The  reader  should  think  of  soap  bubbles  as  models  of  surfaces 
which  minimize  surface  area. 

A  word  on  the  methodology  used  in  this  paper  is  in  order.  As  was  indicated  in  this  section,  exten¬ 
sive  use  is  made  of  the  symbolic  manipulation  program  MACSYMA  to  actually  compute  the  low 
degree  surfaces  which  are  also  minimal  surfaces.  MACSYMA  is  a  trademark  of  Symbolics,  Inc.  The 
computations  were  performed  on  a  SUN  2/120  workstation  running  UNIX.  The  workstation  had  a  phy¬ 
sical  memory  of  4MB  .and  a  virtual  memory  of  approximately  20.1MB.  The  original  goal  of  this 
research  was  to  characterize  all  of  die  minimal  surfaces  which  were  of  low  degree  by  making  use  of 
MACSYMA  far  the  computations.  We  will  see  in  the  next  section  that  extensive  analyses  and 
simplifications  had  to  be  done  in  order  to  be  able  to  use  MACSYMA  efficiently  and  not  exceed  the  lim¬ 
itations  of  the  workstation. 

Of  course,  the  limitations  of  MACSYMA  and  the  memory  of  the  workstation  are  relevant  only 
during  the  characterization  of  those  surfaces  which  are  actually  minimal  surfaces.  The  actual  computa¬ 
tion  of  appropriate  minimal  blending  surfaces  in  applications  is  quite  rapid,  once  the  preliminary  ana¬ 
lyses  have  been  made. 

3.  SURFACES  OF  DEGREE  2 

The  most  general  surface  of  degree  2  is  given  by  the  implicit  equation 

FQcyj)  =  Ax2  +  Bxy  +  Cy 2  +  Dxz  +  Eyx  +  Fz2  +  Gx  +  Hy  +  It  +  /  *  0  (3.1) 

This  equation  involves  10  constants  of  which  9  may  be  arbitrarily  chosen. 

The  goal  of  this  paper  is  to  characterize  the  low  degree  minimal  blending  surfaces.  A  brute  force 
computation  of  the  result  of  the  differential  operator  of  formula  (2.1)  on  F(xyj)  using  implicit 
differentiation  leads  to  an  expression  which  requires  six  pages  to  print  and  which  cannot  be  factored 
directly  within  the  memory  limitations  of  the  computer,  dearly  a  amplification  was  needed,  especially 
since  the  intention  bf  this  research  was  to  classify  the  potential  minimal  blending  surfaces  of  low 
degree.  The  only  possibility  is  to  consider  classification  of  the  various  surfaces  that  can  arise  from  die 
expression  FQcyj).  One  obvious  simplification  is  to  observe  that  all  of  die  linear  terms  can  be  rV’tt- 
inated  by  translation.  It  is  less  dear  that  that  all  of  the  mixed  toms  can  be  eliminated  by  appropriate 
rotations;  the  original  observation  of  this  was  apparently  due  to  Euler.  We  note  that  translations  and 
rotauons  leave  the  minimal  surface  equation  invariant  We  will  use  the  invariance  of  the  minimal  equa¬ 
tion  frequently  without  mention  while  considering  possible  simplifications  of  various  expressions. 

R  moval  of  tie  mixed  terms  leaves  several  possibilities;  the  variable  z  does  not  appear  and  thus 
the  surface  is  of  the  form 


Ax2  +  By2  =  C  (3  2) 

which  is  an  elliptic  cylinder,  hyperbolic  cylinder,  or  two  planes;  the  variable  z  appears  to  only  the  first 
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power  and  thus  the  surface  has  the  equation 

z  =  Ax2  +  By2  +  C 


(33) 


which  is  either  a  paraboloid  (elliptic  or  hyperbolic)  or  a  parabolic  cylinder,  or  the  variable  z  appears  to 
the  second  power  and  the  surface  has  the  equation 


1. 


(3.4) 


Here  all  of  the  unnecessary  linear  terms  have  been  eliminated;  none  of  a ,  b  and  c  are  0.  We  consider 
separately  the  cases  where  z  does  not  appear,  where  it  appears  to  the  first  power  and  where  it  appears 
to  the  second  power. 

If  z  does  not  appear  as  in  equation  (33),  then  the  minimal  surface  equation  implies  that  the  sur¬ 
face  must  be  a  portion  of  a  plane. 

If  z  appears  to  the  first  power  only  as  in  equation  (33),  then  the  minimal  surface  equation  yields 

2 A  (4 1)  +  2R  (44V+  1)*0 


which  is  only  possible  for  arbitrary  x,y,  and  z  only  if  A  =  B  *  0  and  therefore  the  surface  is  a  plane. 

When  the  variable  z  appears  to  the  second  power  as  in  equation  (33),  the  minimal  surface  equa¬ 
tion  yields  two  possible  solutions  :  either  the  surface  is  a  horizontal  plane  or  else 


(E-D)Cx2  +  DC3  +P2C 
dc2-de 


(3.6) 


The  first  equation  is  that  of  a  plane.  The  second  equation  (3.6)  implies  a  restriction  on  the  surface  in 
that  its  points  must  also  lie  on  another  surface,  which  as  we  saw  earlier  was  a  paraboloid. 

Thus  we  have  obtained  the  following  result. 

Theorem 


The  only  minimal  solaces  of  degree  2  are  planes. 


4.  HIGHER  DEGREE  SURFACES 

The  most  general  surface  of  degree  3  is  given  by  the  implicit  equation  G(xyx)  =  0  ,  where 

G(xyj) «  Ax3  +  Bx ^  +  Cxy2  +  Dy3  +  Exh  +  Fxz 2  +  Gz3  +  Hyh  +  lyz2  +  F(xjj)  (4.1) 

and  F(xjx)  is  an  arbitrary  second  degree  expression  in  x ,  y,  and  z .  As  before,  the  minimal  equation 
for  this  general  expression  is  too  complicated  to  submit  to  a  symbolic  algebra  program  and  thus  we 
resort  to  some  simplifications.  The  number  of  possible  cases  is  considerably  larger  than  the  number  of 
possibilities  for  degree  2  surfaces.  Salmon  [13]  indicates  23  possible  forms  which  are  classified  accord¬ 
ing  to  what  he  calls  their  "class”  and  "singularities”.  An  examination  of  5  of  these  forms  failed  to  yield 
any  minimal  surfaces  after  considerable  computer  time  per  example  and  hence  an  alternative  approach 
was  used.  Note  that  the  situation  is  even  worse  for  surfaces  of  degree  4.  Salmon  indicates  that  there 
are  76  different  sperms  of  surfaces.  The  references  [3],  [13],  and  [14]  describe  the  possibilities  for  sur¬ 
faces  of  degree  2, 3,  and  4. 

I~  [2],  the  authors  describ*.  a  method  due  to  Weierstrass  in  which  auxiliary  mappings  are  used  to 
find  parametrizations  of  minimal  surfaces.  The  method  is: 

1.  Choose  an  open,  connected  subset  of  the  plane. 

2.  Choose  a  complex-valued  analytic  function  g  and  differential  w  in  some  domain  D  so 
that  the  expressions  <Xi  a*  and  u3  defined  in  (4.4)  -  (4.6)  satisfy 

2#  =  0 
Sla*  I2  >0. 


(43) 

(43) 
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3.  Form  the  expressions  04,  0^,  and  03  by 


al  *  (I 

(4.4) 

a2»i(l  +  g*)“. 

(4.5) 

aj-gw, 

(4.6) 

4.  Integrate  die  expressions  on  the  right  hand  sides  of  equations  (4.4),  (4.5),  and  (4.6)  from  0  to 
r.  Replace  2  by  n  +  iv.  To  obtain  the  parametrization  of  the  surface,  set  x,  y,  and  2  to  be  the  real 
pans  of  the  integrals. 

It  is  easy  to  see  that  if  each  of  the  a*  is  constant,  then  the  surface  is  a  plane.  In  addition,  if  g  is 
constant,  then  the  expressions  in  equations  (4.4)  and  (4.6)  are  proportional  to  one  another  and  hence  the 
surface  lies  in  a  plane. 

The  choice  j«i,w*1  leads  to  the  surface  known  as  Enneper's  surface  [2]  which  is  described 
by 


f— *• 

(4.7a) 

— 

3  * 

(4.7b) 

u2-v2 . 

(4.7c) 

This  surface  appears  to  be  of  high  degree  even  though  no  terms  in  the  parameterization  have  degree 
higher  than  3;  MACSYMA  indicates  a  degree  of  27  when  using  the  "eliminate'*  command. 

The  choice  g  *  2,  w  *  r  leads  to  a  surface  described  by  the  equations 

h  9  V2  V4  -  6u*V2  +  U* 

X  a  —  1  , 

4  g 

h3v  -  «v3  uv 

ys~  2  2  ’ 

a3  -  3 mv2 
*  *  3 

The  result  from  a  MACSYMA  computation  to  eliminate  the  parameters  and  write  in  implicit  form  is 
even  more  complicated;  it  takes  1288  lines  to  display  and  has  a  degree  of  at  least  32. 

It  is  dear  that  minimal  surfaces  (except  for  planes)  are  either  of  high  algebraic  degree  or  not 
algebraic.  Based  on  these  observations,  we  simply  present  several  well-known  minimal  surfaces  by  their 
parametric  equations  but  will  not  attempt  to  write  them  in  implicit  form. 

It  seems  likely  from  the  above  discussion  that  there  are  only  two  minimal  surfaces  of  low 
parametric  degree.  However,  each  of  these  surfaces  has  additional  degrees  of  freedom.  The  constant 
expression  w  =  1  can  be  replaced  by  the  complex  constant  e  +  fi.  The  parameters  e  and  /  control  the 
rotation  and  scaling  of  the  surface.  The  effect  of  the  other  degrees  of  freedom  is  more  striking.  Each 
term  linear  in  2  can  ^  replaced  by  a  term  of  the  form  A  +  Bz,  where  A  and  B  are  complex;  this  pro¬ 
vides  four  additional  degrees  of  freedom  in  the  surfaces.  The  effect  of  these  additional  degrees  of  free¬ 
dom  will  be  discussed  in  the  next  section. 


The  next  three  equations  show  bow  the  original  parametric  equations  of  Enneper’s  surface  (4.7a 
through  4.7c)  are  influenced  by  the  extra  degrees  of  freedom. 

(1  -  a2  +  b*)u  +  2 abv  _  (ac  -  bd)(u2-  v2)  -  (ad  -  bc)(2uv) 

X  =  2  2 
c(a3  -  3UV2)  -  dOuh  -  v3) 

6 

(1  +  a3  -  '  +  Tabu  _  {ac  -  bd)2uv  +  (ad  4-  bcX«2  -  v2) 

7 "  2  2 


(4.7d) 


(4.7e) 
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z  =  au  -  bv  + 


(4.70 


cQu?v  -  v3)  +  d(u2  -  Suv2) 

6 

c(u2  -  v2)  -  2duv 
2 

This  same  possibility  of  increasing  the  degrees  of  freedom  holds  in  all  of  the  other  surfaces  considered 
in  this  section  to  an  even  greater  degree. 

The  catenoid  arises  from  the  choices  g  =  e*,  w  -  -e~*.  which  leads  to  the  parametrization 

x  =  cos(v)cos(u)  -  1  , 
y  =  sin(v)cosh(u) , 
z  *  u  . 

A  minimal  surface  due  to  C.  C.  Chen  [2]  is  given  by  g  =  z  +  — ,  w  =  z2  and  has  the  parametri¬ 


zation 


lOn^2  -  5uv4  -  u5  3 uv2  -  a3  _  u_ 

X  =  10  6  "  2  ’ 
-v3  +  lOuV  -  5k4v  -  v 

* - 10 - + - 2 - * 

u4  -  fiuV  +  V4  U2  -  V2 


The  final  surface  that  we  present  here  is  due  to  Jorge  and  Meeks  [6].  It  is  obtained  by  choosing 
g  *  z  and  w*(z“  * 1  -  1)~2  and  leads  to  the  parametrization  where  the  coordinates  ,  y ,  and  z  are  the 
integrals  of  the  real  parts  of  the  expressions  a,-  given  by 

«  -  1-** 

= 


2(z"+1-l)2  ’ 
_  .  (1  +  z2*) 

°*  *2(z"  +  l-  if  ’ 

2m 

2 

a,® 


♦  1 


-i)2 


(4.8a) 

(4.8b) 

(4.8c) 


5.  ANALYSIS  OF  THE  SURFACES 

The  goal  of  blending  surfaces  is  to  provide  a  transition  between  surfaces.  A  blending  surface 
should  be  easy  to  generate,  appear  in  a  form  which  can  be  computed  rapidly,  and  interface  well  with 
the  solid  modeling  system  in  which  it  is  being  used.  The  s’Tface  should  be  aesthetically  pleasing  and 
represent  only  a  small  amount  of  additional  material  if  the  actual  solid  is  to  be  created  by  use  of  a 
computer-guided  milling  machine.  The  use  of  the  few  fixed  blending  surfaces  described  in  this  paper 
certainly  meets  die  criterion  of  being  easy  to  generate  since  they  are  selected  from  a  small  list  They 
are  presented  here  by  their  parametric  representation  and  therefore  can  be  computed  rapidly,  perhaps 
not  even  requiring  the  use  of  any  pawbes.  They  share  the  sane  difficulty  as  any  parametric  representa¬ 
tion  in  that  intersections  with  other  surfaces  defined  parametrically  are  much  harder  to  compute  than  if 
the  surfaces  were  described  explicitly.  They  ah  -  allow  a  certain  amount  of  possible  interaction  with  a 
desig-'T  in  the  sense  that  some  of  parameters  which  provide  the  degrees  of  freedom  can  be  altered  to 
change  the  surface.  We  will  illustrate  this  in  detail  for  Ennep ex’s  surface  which  was  discussed  in  die 
previous  section;  the  parameters  u  and  v  will  be  restricted  to  the  unit  square.  In  [8],  this  surface  was 
used  for  blending  two  intersecting  cylinders  of  equal  radius.  The  graph  of  Enneper’s  minimal  surface  is 
given  in  figure  1. 


Figure  1  Enneper’s  surface 

The  effects  of  the  parameters  a.  b,  c,  and  d  of  formulas  (4.7d,  4.7e,  and  4.7f)  on  the  graph  of  the 
surface  ate  shown  in  figures  2*21.  The  figures  are  grouped  by  varying  each  of  the  parameters  singly 
followed  by  consideration  of  several  cases  in  which  the  parameters  are  allowed  to  vary  together. 

The  first  case  that  we  consider  is  when  the  parameters  b  £ ,  and  d  are  all  0  and  the  parameter  a 
varies.  In  this  case,  the  resulting  surface  obtained  from  tee  original  Enneper  surface  is  always  a  plane. 
The  graphs  are  shown  in  figures  2-4. 


Figure  2  a  *  1,  b~c**d*0. 


Figure  6  b  *  2,  a*c*d*0. 

The  effects  of  die  parameter  c  on  tf  g*aph  is  more  interesting,  at  least  in  the  range  that  we  are 
showing  here.  The  surface  displays  a  twist  which  was  not  obvious  in  the  original  Enneper  surface  (the 
case  a=b=dO,  c  =  l)  which  was  shown  in  figure  1.  The  results  are  shown  in  figures  7-9. 


Figure  7  c  =  2,  a-b=d=0. 


Figure  9  c  =  4,  a=b=d*0 


Figures  1-12  indicate  the  effect  of  changing  the  parameters  one  at  a  time.  The  next  case  to  con¬ 
sider  is  when  the  parameters  are  varied  two  at  a  time.  The  simplest  situation  occurs  when  c  and  d  are 
both  0.  An  examination  of  the  effect  of  the  parameters  a  and  b  on  the  parametric  form  of  the  surly* 
given  in  equations  (4.4),  (4.5)  and  (4.6)  for  Enneper’s  surface  shows  that  the  vanishing  of  c  and  d 
implies  that  the  surface  is  a  plane.  For  this  reason,  we  omit  the  graphs  in  this  situation. 

In  figures  13-20,  we  show  the  result  of  some  variations  in  the  parameters  a  and  c. 


i 


Figure  13  a  *  l.c  « 


Figure  14  a  *  1,  c  = 


15  a  *  /  ,  c  ■  3.  b*d=0. 


MMSa 


Notice  that  the  level  corves  in  the  upper  left  comer  of  the  graph  in  figure  20  are  nearly  straight, 
indicating  that  this  type  of  surface  could  be  useful  for  blending  where  one  of  the  surfaces  is  a  plane. 


The  last  variation  Of  this  surface  that  we  give  in  this  paper  is  shown  in  figure  21.  It  uses  the 
values  a  ■  2  and  c  *  3  with  b  and  d  being  0.  Many  of  the  other  choices  of  a,  b, c ,  and  d  in  the 
ranfe  of  integers  0 ..  4  lead  to  surfaces  which  have  cusps  and  are  therefore  unsuitable  for  blending  snr- 


Figure  21  a  *  2,  d  «  3,  b~c~0. 


The  surfaces  given  in  figures  1*21  indicate  some  of  the  modifications  that  can  be  made  to  a  stan¬ 
dard  surface  using  certain  degrees  of  freedom  which  can  be  described  by  the  parameters  a,  b,  c,  and 
d.  Additional  degrees  of  freedom  are  available  in  the  parameters  e  and  /.  These  two  parameters  were 
not  used  here  because  in  this  example,  they  simply  cause  rotation  and  scaling  of  the  surface. 

The  other  potential  blending  surface  that  we  show  here  is  the  Jorge-Meeks  surface.  This  surface 
is  useful  in  blending  together  n  cylinders  of  equal  radius  which  wmwum*  from  the  center.  The 
graph  of  the  surface  is  presented  here  only  in  the  case  n>3.  It  originally  appeared  in  [2]. 
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t.  SUMMARY  OF  RESULTS 

The  motivation  for  this  p%«r  was  the  use  of  gemetric  considerations  in  the  development  and 
implementation  of  blending  algorithms.  As  was  indicated  earlier,  such  constraints  have  typically  not 
been  incorporated  into  die  blending  portions  of  solid  modeling  systems.  The  geometrical  constraint  that 
we  have  considered  here  is  mn  "arizing  the  surface  area  of  a  blending  surface.  This  minimization  of 
surface  area  was  approximated  by  the  use  of  a  solution  to  the  minimal  surface  equation  (2.1)  and  a 
solution  of  the  equation  was  called  a  minimal  surface. 

The  simplest  method  of  generating  minimal  surfaces  was  presented  and  examination  of  the 
method  indicated  some  prevk.  unknown  degrees  of  freedom  in  the  minimal  surfaces.  The  effect  of 
certain  combinations  of  these  degrees  of  freedom  on  the  surfaces  was  indicated  by  several  examples. 

The  results  here  can  be  incorporated  into  a  solid  modeling  system  in  the  following  manner.  The 
user  specifies  the  solids  to  br  Landed  and  (if  desired)  the  curves  on  the  solids  through  which  the 
required  blend  is  to  pass.  The  user  then  selects  the  type  of  fundamental  blending  surface:  Ennep-x, 
catenoid.  Jorge-Meeks,  Chen,  etc.  The  choice  is  made  according  to  the  experience  of  the  user  in  actu¬ 
ally  using  the  particular  surfaces.  The  user  then  selects  the  appropriate  degrees  of  freedom  (a..f  in  the 
case  of  Enneper’s  surface)  and  selects  values  from  a  valuator  device.  The  blends  are  then  sketched 
using  a  large  value  of  the  increment  so  as  to  be  able  to  rapidly  reject  any  obviously  inappropriate 
blends. 

The  number  of  degrees  of  freedom  is  considerably  less  than  the  number  of  degrees  of  freedom  in 
general  even  for  low  degree  surfaces;  Salmon  described  23  "species”  for  the  general  surface  of  degree  3 
which  of  course  has  many  coefficients.  The  situation  is  much  worse  for  surfaces  erf  degree  4  and  hence 
our  method  makes  the  problem  much  more  tractable  by  limiting  the  choices.  The  number  of  degrees  of 
freedom  can  often  be  reduced  even  more  using  our  method  by  noring  that  certain  of  the  parameters 
represent  rotation  and  scaling  of  the  surface  ( e  and  /  in  Enneper’s  surface)  and  that  not  all  parameters 
can  be  0.  The  small  number  of  degrees  of  freedom  is  appropriate  for  matching  the  blending  surface  to 
the  values  of  specific  points  on  the  surfaces  to  be  blended. 

7.  OPEN  PROBLEMS 

The  analysis  given  here  concentrates  on  minimal  surfaces  which  blend  together  two  given  sur¬ 
faces.  No  requirement  has  been  made  of  tangency  to  the  given  surfaces.  Incorporation  of  tangency 
information  would  require  use  of  mgrange  multipliers  in  the  analysis.  It  is  expected  that  the  blending 
surfaces  will  only  approximate  the  actual  tangency  except  in  rare  circumstances  where  the  tangency  is 
exact  The  primary  concern  o  aus  paper  was  the  use  of  blending  surfaces  which  minimized  surface 
area.  However,  in  a  typical  milling  operation,  the  volume  of  excess  material  is  also  important  (espe¬ 
cially  if  the  material  is  expensive). 

Minimizing  volume  is  generally  impossible  unless  the  blending  surface  actually  follows  the  sur¬ 
faces  to  be  blended  exactly,  in  which  case  use  of  a  blending  algorithm  is  obviously  pointless.  It  is  pos¬ 
sible  to  consider  volume  minimization  over  all  surfaces  of  a  fixed  degree,  both  with  or  without  con¬ 
sideration  of  tangency  to  the  surfaces  to  be  blended.  We  intend  to  return  to  this*  subject  in  a  future 
paper. 

The  blending  surfaces  described  in  this  paper  have  relatively  simple  parametric  representations 
yet  lead  to  implicit  equations  of  very  large  algebraic  degree  or  to  non-algebraic  sir '‘aces.  It  would  be 
useful  to  be  able  to  obtain  a  mi.  n»'  surface  of  low  algebraic  degree  or  to  verify  that,  none  are  possible. 
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ABSTRACT 

Many  students  have  great 
difficulty  understanding  concurrent 
programming  at  anything  but  the  most 
superficial  level.  In  this  paper,  we 
describe  some  experience  teaching 
concurrent  programming  in  Ada  and  give 
some  suggestions  for  implementing  the 
ideas  discussed  here. 


1.  INTRODUCTION: 

The  concept  of  concurrent 
programming  is  one  of  the  most 
difficult  ones  for  students  to 
understand.  In  general,  upper  level 
students  have  a  good  understanding  of 
structured  design,  analysis  of 
algorithms,  and  a  rudimentary 
knowledge  of  software  engineering. 

Such  students  generally  are  proficient 
in  several  diferent  programming 
languages  that  are  used  for  sequential 
programs.  In  this  paper,  we  describe 
some  experiences  when  having  students 
learn  about  concurrent  programming  by 
writing  programs  in  Ada.  While  the 
environment  we  describe  is  specific  to 
a  particular  course  at  Howard 
University,  many  of  the  experiences 
encountered  can  be  carried  over  to 
other  environments. 


2.  THE  ENVIRONMENT: 

The  course  in  question  was  an 
advanced  course  in  programming 


languages  taught  to  seniors  and 
graduate  students  at  Howard.  The 
undergraduate  students  were  majoring 
in  Computer  Systems  Engineering  and 
the  graduate  students  were  majoring  in 
Computer  Science;  all  students  were 
majoring  in  the  School  of 
Engineering.  All  students  were 
proficient  in  the  languages  FORTRAN, 
Pascal,  and  C  before  beginning  the 
study  of  Ada.  In  addition,  many  of 
them  were  exposed  to  a  variety  of 
other  languages  including  LISP, 

PROLOG,  or  PL/1. 

The  topic  of  concurrent 
programming  was  introduced  by  means  of 
some  standard  examples.  The  notion  of 
Petri  net  was  presented  and  used  as  a 
framework  for  the  discussion  of 
concurrency  in  tasking.  No  other  more 
formal  model  of  concurrency  was  used. 

This  method  of  presentation  is  in 
agreement  with  the  philosophy  in  [1) 
and  [ 2 ] . 

The  Ada  environment  at  Howard  is 
not  conducive  to  the  development  of 
large  Ada  programs.  Most  of  the  Ada 
assignments  are  done  on  a  Digital 
Equipment  Corporation  VAX  1/780  with 
4MB  of  main  memory  which  runs  under 
VMS.  The  "compiler"  is  the  classic 
NYU  AdaEd.  Because  of  the  size  of  the 
working  set  needed  for  compilation 
under  this  compiler,  students  were 
encouraged  to  submit  jobs  for  batch 
processing.  Interactive  compilations 
and  executions  were  limited  to  one 
terminal  at  a  time,  since  working  sets 
of  2MB  were  allocated  to  Ada 
processes.  In  addition,  students 
learned  Ada  syntax  by  writing  small 
Ada  programs  using  the  limited  Janus 
Ada  on  personal  computers.  There  were 
21  students  in  the  class.  A  much 
larger  class  would  have  been 
unmanageable  for  this  project.  Note 
however  that  the  assignment  of 
different  projects  eliminated  the 
propagation  of  correct  solutions  since 
each  of  the  students  had  a  tasking 
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synchronization  problem  that  was  at 
least  superficially  different  from 
that  of  the  other  students.- 


3.  THE  PROJECTS: 

The  students  were  each  assigned  a 
project  for  which  they  were  required 
to  write  Ada  programs  which  involved 
at  least  two  tasks.  In  general, the 
tasks  embodied  some  simple  idea  that 
the  students  were  very  familiar  with 
at  least  in  the  case  of  sequential 
programs.  Thus  the  difficulty  was  in 
understanding  the  concurrency  and  not 
in  the  computation  performed  by  the 
individual  task. 

As  an  example,  one  student  was 
asked  to  write  a  program  with  two 
tasks  -  sort  an  array  of  integers  and 
then  search  for  a  key  using  a  binary 
search.  The  student  was  allowed  to 
use  any  sorting  algorithm.  Thus  the 
student  did  not  have  difficulty 
implementing  the  individual  algorithms 
for  the  tasks.  The  troublesome  part 
was  the  implementation  of  the 
synchronization  or  communication  of 
the  tasks. 

Projects  involving  similar,  but 
not  identical  problems  in 
synchronization  were  given  to  other 
students.  Some  examples  of  these 
assignments  are: 

1.  Write  a  program  which  uses 
two  tasks  to  solve  quadratic 
equations  using  the  quadratic 
formula.  Each  task  must  perform  at 
least  three  arithmetic  operations. 

2.  Write  a  program  to  read  an 
integer  n  and  to  have  two  tasks.  The 
tasks  are  to  compute  some  simple 
function  f(n)  and  to  find  all  primes 
less  than  f(n). 

3.  Write  a  program  to  simulate 
the  donning  of  socks  and  shoes. 

Putting  on  socks  and  putting  on  shoes 
are  to  be  separate  tasks. 

4.  Write  a  program  to  read  an 
array  A  of  integers  and  to  have  two 
tasks.  The  tasks  are  to  sort  the 
array  A  in  increasing  order  passing 
this  sorted  array  to  B  and  to  sort  the 
array  B  in  decreasing  order. 

Clearly  the  major  difficulty  for 
the  students  was  the  synchronization 
®f  tasks.  Students  were  required  to 
run  their  programs  four  times  with  the 

input.  Most  of  the  errors  in  the 
^k°®rams  ^ue  to  suhtle  assumptions 
•bout  tasking  made  by  the  programmer 
became  apparent  after  four  runs. 


Student's  observations  on  this 
point  were  interesting.  In  spite  of 
several  lectures  on  timing  and 
synchronization  of  tasks,  lengthy 
discussions  on  the  nature  of  an  Ada 
"logical  processor",  and  numerous 
classroom  examples,  students  did  not 
believe  that  programs  could  give 
different  results  or  bomb  when  given 
the  same  input.  The  sudden  shock  when 
their  program  shoved  this  behavior  put 
the  point  across  better  than  any 
lecture  could.  Many  of  the  students 
indicated  that  they  had  seen  this  kind 
of  error  at  some  time  during  program 
development.  Two  of  the  students  were 
so  shocked  by  the  different  behaviour 
of  the  sample  runs  that  they  turned  in 
their  projects  with  signatures  of 
witnesses  that  their  programs  ran 
successfully,  at  least  once. 


4.  CONCLUSIONS  AND  SUG  ESTIONS  FOR 
IMPLEMENTATION 

The  students  who  had  been  through 
this  assignment  seemed  to  have  a 
better  understanding  of  concurrent 
tasking  in  Ada  than  did  students  in 
previous  semesters.  The  assignment  of 
programs  involving  several  (perhaps 
trivial)  tasks  which  needed  to  be 
synchronized  or  communicated.  This 
can  be  implemented  in  several  ways. 

1)  Assign  different  projects  to 
students  (or  to  small  groups  of 
students)  requiring  them  to  have  at 
least  three  or  four  runs  of  their 
program. 

2)  Assign  the  same  project  to 
different  students  (or  groups)  and 
have  them  compare  sample  runs  on  the 
same  data.  This  should  point  out 
difficulties  in  tasking. 

3)  Write  a  program  yourself  to  do 
one  of  the  assignments  given  above. 
Don't  think  carefully  about  all 
possible  orders  of  execution  of  the 
tasks.  Your  program  is  likely  to  have 
different  outputs  depending  on  the 
actual  physical  implementation  of  the 
tasks . 
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Abstract — A  considerable  amount  of  research  has  been  done  in  the  area  of  human-computer  interaction. 
High  quality  human-computer  interfaces  are  especially  important  when  a  program  needs  to  perform  in 
nearly  real-time.  In  this  paper,  a  particular  human-computer  interface  is  analyzed.  The  observations  made 
are  abstracted  and  generalized  to  problems  in  interfaces  for  proces  control.  Specific  benchmarks  for  evaluation 
of  interfaces  in  a  real-time  or  near  real-time  setting  are  developed.  We  show  that  these  benchmarks  are  more 
useful  than  general  performance  measures  of  computer  systems  for  — Hmating  computer  performance.  We 
develop  general  guidelines  for  designing  interfaces  which  satisfy  severe  constraints  and  determine  to 
what  extent  they  follow  generally  accepted  principles  for  interface  design. 


1.  INTRODUCTION 

A  frequent  use  of  computers  is  in  the  control  of  pro¬ 
cesses.  In  many  situations,  processes  that  were  con¬ 
trolled  by  a  group  of  individuals  are  now  controlled 
by  one  or  more  computers  with  human  operators  per¬ 
forming  a  monitoring  function.  The  operator  takes 
control  only  for  short  periods  of  time  for  testing  or 
when  an  emergency  occurs.  The  human-machine  in¬ 
terface  is  especially  important  in  such  a  situation. 
Poorly  designed  systems  produce  operator  boredom 
and  often  cause  operator  errors.  At  critical  times,  the 
response  time  of  a  poorly  designed  system  may  be  too 
slow  for  effective  process  control. 

There  is  a  large  and  growing  body  of  research  on 
human-computer  interfaces,  with  some  design  prin¬ 
ciples  beginning  to  emerge.  Much  of  the  research  in 
this  area  is  based  either  on  laboratory  experiments  or 
is  anecdotal  and  based  on  observations  of  adherence 
to  general  principles.  See  the  article  by  Foley,  Wallace 
and  Chan  [1]  for  an  excellent  survey  of  research  on 
the  psychology  of  human-computer  interaction  with 
emphasis  on  the  area  of  computer  graphics. 

Often  the  best  user  interfaces  require  extensive  use 
of  computer  graphics  displays.  Using  graphics  in  the 
interface  increases  the  load  on  the  computer  system 
and  requires  an  expenditure  for  hardware  that  ranges 
from  very  small  (personal  computers)  to  quite  large 
(high  performance  color  graphics  workstations).  Thus 
the  design  of  a  user  interface  must  take  into  account 
several  factors  such  as  availability  and  cost  of  graphics 
hardware,  demand  on  the  computing  system,  and  any 
requirement  for  real-time  performance. 

There  is  also  a  large  and  growing  body  of  research 
measuring  the  performance  of  computer  hardware  and 
software.  Results  in  this  area  are  often  results  in  the 
areas  of  computer  architecture,  analysis  of  algorithms, 
simulations,  reliability  models,  or  some  combination 
of  these  areas.  See  [2],  [3],  [4),  and  [5]  for  typical  results 
in  these  areas. 

In  Section  2,  we  will  briefly  discuss  the  evolution  of 
several  generations  of  a  particular  human-computer 
interface.  This  particular  interface  was  studied  by  the 
author  at  the  Goddard  Space  Flight  Center  of  NASA 
while  the  author  was  on  a  NAS  A/ AS  EE  Faculty  Fei- ' 


lowship.  The  results  are  extended  and  generalized 
greatly  in  the  rest  of  the  paper.  Particular  emphasis  is 
given  to  the  development  of  benchmarks  (Section  3) 
and  guidelines  for  evaluating  and  predicting  perfor¬ 
mance  (Section  4).  In  general,  we  show  bow  commonly 
accepted  principles  of  design  of  human-computer  in¬ 
terfaces  can  be  combined  with  specific  design  require¬ 
ments  without  significantly  degrading  performance  in 
many  instances. 

2.  A  TYPICAL  EXAMPLE 

A  control  room  at  the  Goddard  Space  Flight  Center 
of  NASA  is  responsible  for  the  control  and  operation 
of  many  spacecraft.  A  typical  spacecraft  sends  back 
information  to  ground  telemetry  stations  on  a  regular 
basis;  the  amount  and  type  of  information  varies  when 
certain  experiments  or  ground  tests  are  performed.  This 
telemetry  information  is  then  relayed  to  a  ground 
computer  system  which  consists  of  various  processors 
dedicated  to  recording  telemetry  data  sent  ba ck  by  a 
spacecraft,  executing  applications  programs  and  con¬ 
trolling  communications  interfaces  to  various  other 
devices.  Generally,  these  ground  computers  have  a 
moderately  heavy  steady-state  computing  load,  which 
is  well  understood  at  the  time  that  a  mission  is  planned. 
There  is  considerable  hardware  redundancy,  an  appli¬ 
cations  processor  is  kept  in  reserve  in  case  of  Mure. 
Because  of  the  design  requirements,  any  graphics  must 
be  done  locally,  with  little  or  no  load  on  the  applications 
computers.  Information  about  the  control  of  the 
spacecraft  must  be  available  to  the  control  system 
within  two  seconds,  with  real-time  response  desirable. 

Originally,  a  control  center  would  have  had  many 
monochrome  monitors  displaying  alphanumeric  in¬ 
formation.  Redundancy  was  provided  by  having  more 
than  one  person  look  at  the  same  information.  These 
displays  created  little  demand  on  system  resources. 
However,  training  of  operators  was  slow  and  there  was 
a  relatively  high  rate  of  error  and  fatigue,  even  for  ex¬ 
perienced  operators.  An  operator  must  monitor  many 
existing  experiments  and  must  initiate  others  in  order 
to  assure  that  the  spacecraft  is  functioning  properly. 

A  large  amount  of  human  factors  research  has  been 
applied  to  the  design  of  control  rooms.  In  particular. 
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the  use  of  graphics  displays  and  color  has  had  a  pro¬ 
nounced  effect  on  operator  training  and  efficiency.  The 
current  trend  in  control  rooms  is  to  provide  more  in¬ 
formation  to  fewer  operator/analysts  by  using  cdor 
computer  graphics  displays  on  CRTs.  See  [6]  and  [7] 
for  mote  information  about  the  evolution  and  design 
of  these  displays. 

Use  of  graphics  displays  in  control  room  setting  falls 
into  three  main  categories:  monitoring  of  processes, 
testing  and  control,  and  display  of  moving  objects. 
Monitoring  of  processes  includes  plotting  of  two  di¬ 
mensional  graphs.  Typical  graphs  are  plots  of  reserve 
levels  of  batteries  or  fuel  consumption  over  time. 
Monitoring  of  processes  also  includes  showing  config¬ 
urations  of  ground  computer  networks  and  availability 
of  various  computers.  This  type  of  activity  does  not 
require  state  of  the  art  graphics  equipment  The  prin¬ 
cipal  need  is  for  rapid  screen  updating  and  display  of 
stored  graphical  data.  Current  plans  are  to  use  IBM 
PC  ATs  with  special  graphics  cards  and  storage  devices 
for  these  displays  [8,  9]. 

Testing  and  control  involves  operator/analysts  de¬ 
ciding  which  tests  to  perform  on  a  satellite  or  spacecraft 
Control  rooms  for  current  flights  use  a  touch  screen 
on  a  CRT  display  of  a  “command  panel”  which  pre¬ 
sents  many  of  the  options  of  tests  to  be  run.  These 
command  panels  have  evolved  from  alphanumeric 
displays  in  which  an  operator  typed  in  the  test  that 
was  to  be  performed.  There  is  some  consideration  of 
saving  expensive  screen  “real  estate”  by  using  func¬ 
tional  tablets  which  have  some  of  their  commands 
preset 

The  major  issues  here  are  interface  technology,  ease 
of  learning,  and  ease  of  use  once  the  command  system 
has  been  mastered.  Touch  screen  technology  provides 
reasonably  fast  sensing  at  moderate  cost  (2S6  by  2S6 
touch  points  and  SO  points  per  second  conversion  rate 
is  a  typical  example  [10].)  Operator  satisfaction  with 
the  ease  of  learning  and  use  of  this  system  is  high. 
Anecdotal  evidence  that  a  touch  screen  is  preferable 
to  a  light  pen  agrees  with  the  findings  of  [1 1]  and  [12]. 
We  note  that  there  is  no  need  for  expensive  graphics 
processors  with  huge  display  list  memories  in  the  case 
of  testing  and  control  The  primary  need  is  for  input 
devices  and  relatively  inexpensive  graphics  worksta- 

tiC'ES. 

The  most  exacting  use  of  computer  graphics  in  a 
control  room  at  NASA  is  in  the  display  of  moving 
objects  such  as  the  space  shuttle.  The  simplest  way  to 
display  the  shuttle  involves  using  a  wire  frame  model 
with  no  removal  of  hidden  lines.  Adding  the  antennas, 
cargo  bay  doors,  and  robot  arms  complicates  the  pic¬ 
ture  so  that  hidden  lines  must  be  removed  in  order  to 
make  the  picture  useful  Allowing  for  three  dimensional 
rotation  means  that  each  vertex  in  the  model  of  the 
shuttle  will  require  the  multiplication  of  a  vector  by  a 
matrix.  For  a  reasonably  useful  wire  frame  drawing, 
this  will  use  up  approximately  one  half  of  the  cpu  time 
of  a  minicomputer  such  as  a  VAX  1 1/780,  which  is 
completely  unsatisfactory.  Using  color  graphics  rep¬ 
resentation  of  solid  objects  with  hidden  surface  removal 
and  some  shading  greatly  increases  the  computing  load. 


The  only  reasonable  solution  is  to  use  a  powerful 
graphics  workstation  with  hardware  matrix  operations 
and  many  of  the  necessary  algorithms  in  hardware. 
See  [2],  [3],  and  [13]  for  typical  estimates  of  computer 
load  caused  by  three  dimensional  rotation. 

3.  EVALUATION  OF  AN  INTERFACE;  BENCHMARKS 

In  Section  2  of  this  paper,  we  discussed  a  control 
center  as  an  grampi*  0f  a  user  interface  and  the  en¬ 
vironment  in  which  it  is  used.  The  evolution  of  user 
interfaces  in  this  setting  was  discussed.  We  now  turn 
our  attention  to  the  evaluation  of  the  performance  of 
general  interfaces  in  a  real  time  setting.  In  this  section, 
we  discuss  the  hardware  and  software  requirements  of 
interfaces,  with  emphasis  on  benchmark  programs.  Our 
primary  emphasis  is  on  evaluation  of  interfaces  which 
control  large  numbers  of  relatively  static  displays  for 
monitoring  of  processes  or  for  testing  and  control  In¬ 
terfaces  for  display  of  the  motion  of  complex  objects 
will  not  be  considered  in  this  paper. 

The  first  question  to  be  considered  is  how  we  will 
be  able  to  store  the  displays.  We  assume  that  the  logical 
organization  of  the  displays  is  hierarchical  with  rela¬ 
tively  few  levels  and  a  fairly  large  number  of  children 
possible  from  each  parent  node.  Results  in  [12],  [1], 
and  16]  suggest  that  this  is  the  most  effective  design  for 
the  menu  selection. 

We  note  that  there  are  only  a  few  possibilities  for 
the  storage  of  data  for  any  display:  as  collections  of 
pixels  in  either  compressed  or  uncompressed  form  or 
else  they  can  be  stored  as  collections  of  instructions  to 
a  display  processor.  The  various  displays  can  be  shown 
as  entire  screens,  portions  of  a  screen  in  a  window,  or 
on  several  screens.  Selections  can  be  made  from  menus 
that  range  from  fixed  and  permanent  as  in  the  case  of 
hard-coded  touch  tablets  to  relatively  fixed  such  as  in 
touch  panels  on  CRTs  and  finally  to  menus  that  are 
rarely  visible  as  in  pop-up  or  pull-down  menu  systems. 

Only  a  few  colors  are  needed  for  a  color  display  of 
monitoring  and  testing  of  processes.  For  example,  re¬ 
search  at  NASA  indicates  that  3  or  6  colors  suffice  for 
control  room  use  [6].  Thus  the  major  factors  in  using 
bit  mapped  displays  for  monitoring  and  testing  are 
screen  resolution,  size  of  primary  and  secondary 
memory,  speed  of  access  of  primary  or  secondary 
memory,  speed  of  screen  updating,  and  of  course  the 
nature  of  the  displays  and  the  way  they  are  stored. 
Much  of  this  information  can  be  obtained  from  man¬ 
ufacturer’s  specifications,  at  least  to  a  first  approxi¬ 
mation.  However,  the  performance  of  an  interface  also 
depends  on  the  speed  of  various  input  devices  and  the 
speed  and  ease  with  which  a  human  user  can  interact 
Thus  the  evaluation  of  the  speed  of  an  interface  in  a 
control  system  which  needs  to  approximate  real  time 
response  can  best  be  estimated  by  a  set  of  benchmark 
programs. 

To  aid  in  the  determination  of  actual  running  times 
of  the  various  portions  of  a  user  interface,  we  will  use 
a  notation  similar  to  but  not  identical  to  that  of  [4): 

c  “  time  to  draw  a  circle, 

p(j)  m  time  to  draw  a  polygon  with  j  sides  (j  >  1), 

t  *  time  to  draw  a  character. 
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any  of  the  above  symbols  followed  by  an  / denotes  the 
time  to  fill  the  object  (to  fill  a  character  means  to  fill 
the  smallest  rectangle  defining  the  character). 

Finally,  we  denote  the  time  to  write  a  segment  5  as 

TIMEOS)  -  sum  of  all  (c + p + 1 + c/+  p/+ 1  f) 

for  all  objects  in  S. 

In  [4],  formulas  were  presented  for  m«»«iiring  the 
total  time  to  display  all  possible  displays  in  a  large 
scale  system  for  monitoring  and  process  control  In 
that  paper,  we  were  interested  in  the  way  that  these 
times  are  modified  when  a  display  is  stored  as  a  segment 
in  a  structured  display  file,  has  to  be  searched  for  in  a 
tree,  is  in  secondary  storage,  or  needs  to  be  compiled. 
Our  goal  here  is  different,  since  we  are  interested  in 
the  interface  rather  than  the  speed  of  rewriting  displays. 
Thus  we  will  ignore  the  factors  DFTLE  (used  for  seg¬ 
ments  in  a  structured  display  file),  SECFILE  (for  sec¬ 
ondary  storage),  or  COMPILE  (for  compiling  a  seg¬ 
ment).  We  will  need  to  use  the  factors  LEVEL  rad 
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SEARCH  which  are  related  to  the  actual  selection  of 
a  display.  We  will  use  the  formula 

TIME  -  (LEVEL  +  1  )*SEARCH.TIME(5) 

to  represent  the  time  to  write  a  display  to  the  screen. 
Here  SEARCH  represents  the  time  for  searching  a 
menu  and  either  selecting  the  desired  object  for  viewing 
or  determining  that  the  object  is  not  present.  LEVEL 
will  serve  as  a  counter  for  the  length  of  the  path  from 
the  root  of  the  tree  to  the  menu  for  the  desired  display. 
By  convention,  the  length  of  a  path  from  the  root  to 
itself  is  0. 

These  benchmark  are  in  the  form  of  pseudocode 
programs  which  will  be  grouped  into  sets  of  three  as 
follows.  The  first  three  programs  show  the  speed  of 
replacing  a  screen  of  text  by  another  screen  of  text 
These  programs  will  use  a  procedure  TEXTSCREEN 
which  will  write  20  lines  of  40  characters  each.  The 
command  SYSTEMflTME)  will  provide  the  elapsed 
time  since  the  last  call  to  SYSTEMflTME).  The  next 
set  of  three  programs  will  be  obtained  from  the  first 
set  by  changing  graphics  instead  of  text  The  last  two 
sets  will  duplicate  the  first  two  sets  in  a  window  envi¬ 
ronment 


PROGRAM  REPLACE_TEXT, 

(•  TIME  FOR  UPDATING  OF  TEXT  PAGE  WITH  NO  INPUT  •) 
BEGIN 

TEXTSCREEN; 

OLDTIME SYSTEMflTME); 

FOR  I 1  TO  100  DO 
BEGIN 
CLEAR; 

TEXTSCREEN; 

END; 

NEWTIME  SYSTEMflTME); 

TEXT-TIME  (  NEWTIME  -  OLDTIME  )  /  100  ; 
RETURN(TEXT_TTME) ; 

END. 


PROGRAM  SEARCH_REPLACE_TEXT  ; 

(•  TIME  FOR  UPDATING  OF  TEXT  PAGE  USING  SELECTING  DEVICE  •) 

BEGIN 

TEXTSCREEN; 

OLDTIME SYSTEMflTME); 

FOR  I  :*  1  TO  100  DO 
BEGIN 

SEARCHMENU  ; 

CLEAR; 

DISPLAY-SELECTED-TEXTSCREEN; 

END; 

NEWTIME  :«  SYSTEMflTME)  -  OLDTIME  ; 

SEARCH  (  NEWTIME  -  OLDTIME  )  /  100  -  TEXT-TIME  ; 

RETURN(SEARCH); 

END. 

Here  SEARCHMENU  is  obtained  empirically  by  having  a  user  make  a  selection  from  the  appropriate  menus. 
We  expect  that  the  running  time  of  the  two  programs  given  above  will  be  very  close.  Any  large  difference 
indicates  that  the  effective  time  used  by  the  selection  process  is  large.  (We  are  assuming  that  the  looping  itself 
takes  negligible  time.  Obvious  modifications  can  be  made  if  this  is  not  the  case.) 
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PROGRAM  TREE  -SFARCH— REPLACE— TEXT  ; 

(•  TIME  FOR  UPDATING  TEXT  PAGE  WITH  SELECTING  DEVICE  AND 
SEARCHING  THE  TREE  OF  MENUS  •) 

BEGIN 

TEXTSCREEN; 

HEIGHT 10 ; 

OLDTIME  SYSTEM(TIME) ; 

FOR  I 1  TO  100  DO 
FOR  J  1  TO  HEIGHT  DO 
BEGIN 

SEARCHMENU; 

CLEAR; 

DISPLAY-SELECTED-TEXTSCREEN; 

CHANGE-LEVEL; 

END; 

NEWTIME  SYSTEM(TIME); 

LEVEL (  NEWTIME  -  OLDTIME  )  /  (100  •  HEIGHT  ) 

-  (  SEARCH  +  TEXT-TIME  )  /  HEIGHT  ; 

RETURN(LEVEL); 

END. 

A  call  to  the  procedure  CHANGE  LEVEL  involves  the  changing  of  menus  to  a  menu  for  a  new  level  of  the 
menu  tree. 

The  next  set  of  three  programs  determines  a  typical  time  for  updating  graphics  displays.  These  programs  are 
obtained  from  the  first  set  by  replacing  “TEXTSCREEN'’  by  “GRAPHICS— SCREEN";  the  details  will  be 
omitted. 

The  remaining  two  sets  of  three  benchmark  programs  will  measure  the  efficiency  of  interfaces  in  a  window 
environment  Basic  concerns  here  are  measurement  of  the  behavior  of  the  window  updating  system  and  of  the 
selection  devices  when  the  system  is  under  heavy  load.  Concerns  about  stationary,  pop-up,  and  pull-down 
menus;  screen  layout;  and  shortcuts  through  the  tree  structure  hierarchy  to  frequently  used  menu  items  will  be 
postponed  to  Section  4. 

As  before,  we  will  present  the  programs  for  the  updating  of  text  windows  and  merely  indicate  the  modification 
necessary  for  having  graphics  displays  in  windows. 

PROGRAM  REPLACE— WINDOW_TEXT ; 

(•  UPDATE  TEXT  DISPLAYS  IN  VARIOUS  WINDOWS  .) 

BEGIN 

GET— WINDOW-ENVIRONMENT ; 

SELECT-WINDOW; 

CLEAR; 

TEXTSCREEN; 

OLDTIME :«  SYSTEM(TIME); 

FOR  I 1  TO  100  DO 
BEGIN 

SELECT-WINDOW; 

CLEAR; 

TEXTSCREEN; 

END; 

NEWTIME SYSTEM(TTME); 

WINDOW— TEXT— TIME  (  NEWTIME  -  OLDTIME  )  /  100  ; 

RETURN  (WINDOW-TEXT-TIME); 

END. 

PROGRAM  SEARCH_REPLACE_WINDOW_TEXT; 

(•  USE  SELECTION  DEVICE;  REPLACE  TEXT  IN  WINDOW  •) 

BEGIN 

GET_WINDOW_ENVIRONMENT  ; 

SELECT-WINDOW ; 

CLEAR; 

TEXTSCREEN  ; 

OLDTIME SYSTEM(TIME); 

FOR  I  :*  1  TO  100  DO 
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BEGIN 
SEARCH; 

SELECT-WINDOW ; 

CLEAR  ; 

TEXTSCREEN  ; 

END; 

NEWTIME  SYSTEM  (TIME) ; 

WINDOW-SEARCH  (NEWTIME  -  OLDTIME)  /  100  -  WINDOW_TEXT_TIME  ; 

RETURN  (WINDOW-SEARCH) ; 

END. 

PROGRAM  TREE_SEARCH_REPLACE_ WINDOW-TEXT  ; 

(•  CHANGE  LEVELS  OF  MENU  SELECTION  TREE  ;  SELECT  MENU  ITEM  ; 

REPLACE  TEXT  SCREEN  IN  A  WINDOW  ENVIRONMENT  •) 

BEGIN 

GET_WINDOW_ENVIRONMENT  ; 

SELECT-WINDOW; 

CLEAR; 

TEXTSCREEN; 

HEIGHT 10 ; 

OLDIIME  :•  SYSTEMfTIME); 

FOR  I 1  TO  100  DO 
FOR  J  1  TO  HEIGHT  DO 
BEGIN 
SEARCH; 

SELECT-WINDOW; 

CLEAR; 

TEXTSCREEN; 

CHANGE-LEVEL; 

END; 

NEWTIME SYSTEMfTIME); 

WINDOW-LEVEL  ( NEWTIME  -  OLDTIME  )  /  (  100  •  HEIGHT ) 

I  -  (  WINDOW-SEARCH  +  WINDOW-TEXT-TIME  )  /  HEIGHT; 

RETURN  (WINDOW_LEVEL) ; 

,  END. 


The  remaining  three  programs  are  obtained 
g  by  replacing  “TEXTSCREEN”  by  “GRAPHICS- 
■  SCREEN.”  These  programs  suggest  how  a  set  of 

I  benchmarks  can  be  written  for  the  evaluation  of  the 

real-time  performance  of  an  interface.  As  such,  our 
viewpoint  is  different  from  that  of  [5]  where  the  main 
concern  is  the  evaluation  of  overall  window  updating 
speed,  rather  than  the  selection  process. 

These  benchmark  programs  produce  the  values  of 
i  SEARCH  and  LEVEL  as  described  above.  They  also 

give  the  values  of  WINDOW-SEARCH  and  WIN¬ 
DOW-LEVEL,  which  describe  the  same  quantities  in 
a  window  environment.  We  will  use  these  values  in 
Section  4. 

4.  EVALUATION  OF  AN  INTERFACE;  ANALYSIS 
’  We  now  use  the  quantities  SEARCH,  LEVEL, 

WINDOW-SEARCH,  and  WINDOW-LEVEL  to 
analyze  the  performance  of  interfaces  for  monitoring 
of  processes  and  for  testing  and  control.  Since  we  are 
considering  only  the  case  of  interfaces  in  a  real-time 
environment,  we  assume  the  existence  of  an  external 
>  quantity  called  ABSOLUTE-TIME— LIMIT  which 

represents  the  maximum  time  that  is  available  for  all 
of  the  computer  operations  including  the  action  of  the 
interface  and  the  retrieval  and  display  of  data.  Since 


the  design  of  the  interface  is  logically  independent  of 
the  design  of  the  data  storage  and  retrieval  algorithms, 
we  restrict  our  attention  to  a  constant  MAX— TIME 
which  is  the  maximum  time  that  any  response  of  the 
interface  can  have.  Clearly, 

MAX-TIME  +  RETRIEVAL-TIME 

-  ABSOLUTE-TIME-LIMIT. 


If 

RETRIEVAL-TIME 

>-  ABSOLUTE-TIME-LIMIT, 

then  no  interface  is  possible  that  meets  the  required 
conditions  and  the  hardware  and  software  require¬ 
ments  for  the  project  must  be  changed,  From  this  point 
on,  we  assume  that  some  interface  is  possible  within 
the  constraint  of  MAX— TIME.  We  also  assume; 

1.  Some  portion  of  the  interface  requires  graphics. 

2.  The  chosen  display  hardware  and  software  permit 
mixed  text  and  graphics  with  a  variety  of  input  devices. 
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3.  That  this  mixture  is  available  both  within  a  win¬ 
dow  management  system  and  outside  it 

4.  SEARCH  <  WINDOW_SEARCH. 

5.  LEVEL  <  WINDOW_LEVEL. 

Assume  that  there  is  a  total  of  M  options  available 
from  menus  and  that  the  menus  are  arranged  hierar¬ 
chically.  Then  the  two  extremes  of  each  node  of  the 
tree  (except  the  last)  having  one  child  and  the  root 
having  M  -1  children  correspond  to  times  of  Mo 
LEVEL  and  M  •  SEARCH,  respectively.  If  each  of 
these  numbers  is  smaller  than  MAX-TIME,  then  every 
arrangement  of  menus  is  feasible  from  a  real-time  per¬ 
formance  viewpoint  and  only  human  factors  consid¬ 
erations  need  be  considered.  A  similar  statement  holds 
for  WINDOW_LEVEL  and  WINDOW_SEARCH. 
We  thus  restrict  our  attention  to  the  case  that  at  least 
one  of 

Mo  LEVEL >  MAX-TIME,  (l) 

Mo  SEARCH  >  MAX-TIME,  (2) 

Mo  WINDOW— LEVEL  >  MAX— TIME,  (3) 
Mo  WINDOW— SEARCH  >  MAX— TIME  (4) 

is  true.  Note  that  we  make  no  assumption  about  which 
of  LEVEL  and  SEARCH  or  which  of  WINDOW 
-LEVEL  and  WINDOW-SEARCH  is  larger. 

There  are  several  alternatives  available  to  improve 
the  real-time  performance  of  the  interface. 

1.  If  eqn  (1)  holds  and  eqn  (2)  does  not,  then  in¬ 
creasing  the  degree  of  each  node  at  the  cost  of  increasing 
SEARCH  will  speed  up  the  worst  case  performance. 

2.  If  eqn  (2)  holds  but  eqn  (1)  does  not,  then  de¬ 
creasing  the  degree  of  each  node  at  the  cost  of  increasing 
LEVEL  will  improve  worst  case  performance. 

3.  If  eqn  (1)  and  eqn  (2)  both  bold,  then  the  only 
inexpensive  solution  is  to  use  windowing.  Clearly  eqn 
(3)  and  eqn  (4)  are  also  tree  as  stated.  However,  in  this 
case  we  note  that  many  different  choices  may  be  dis¬ 
played  in  windows  and  thus  (1)  and  (2)  should  be  re¬ 
placed  by 

Mo  WINDOW-LEVEL  / 

NUMBER_OF_ WINDOWS  <  MAX-TIME,  (3a) 

Mo  WINDOW_SEARCH  / 

NUMBER-OF  WINDOWS  <  MAX-TIME.  (4a) 

We  apply  the  same  analyses  used  in  cases  1  and  2 
in  this  situation. 

4.  In  each  of  cases  1-3,  we  assumed  that  there  were 
no  shortcuts  available  in  the  hierarchical  menu  struc¬ 
ture.  However,  many  selection  devices  (such  as  a  three 
button  mouse)  allow  for  omitting  several  intermediate 
levels  of  a  tree.  This  should  be  used  for  experienced 
users  to  increase  operator  satisfaction  and  to  increase 
speed  of  the  interface.  This  was  pointed  out  in  [1].  We 


also  suggest  that  only  stationary  or  pull-down  menus 
be  used  for  clarity. 

5.  Not  all  operations  in  the  monitoring  of  processes 
or  testing  and  control  need  be  carried  out  in  real-time. 
Those  operations  should  be  on  the  bottom  level  of  the 
tree  so  as  to  allow  faster  traversals  of  the  tree  in  most 
cases. 

6.  Use  an  additional  human  operator  to  reduce  the 
load  on  the  interface  by  a  factor  of  2. 

7.  Revise  the  underlying  hardware  and  software. 

These  guidelines  should  not  be  considered  complete 

but  instead  as  suggestions  for  actions  when  an  interface 
for  real-time  activity  is  predicted  to  be  overloaded. 

S.  SUMMARY 

In  this  note,  we  have  described  an  example  of  a  user 
interface  for  monitoring  of  processes;  testing  and  con¬ 
trol;  and  display  of  three  dimensional  motion  and  how 
it  evolved  over  time.  The  problems  encountered  there 
were  abstracted  and  generalized  in  order  to  develop  a 
general  model  for  the  analysis  of  the  performance  of 
human  computer  interfaces  for  programs  which  are 
used  for  monitoring  of  processes  and  for  testing  and 
control.  Emphasis  was  given  to  specific  benchmarks 
and  guidelines  for  the  design  and  evaluation  of  “real¬ 
time"  interfaces,  rather  than  the  general  performance 
measurements  often  used  [3]. 
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